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CG3842 homologous proteins involved in the regulation of energy 
homeostasis 

Description 

This invention relates to the use of nucleic acid sequences encoding 
CG3842 or SCAD homologous proteins, and the polypeptides encoded 
thereby and to the use thereof or effector molecules of CG3842 or SCAD 
homologous nucleic acids or polypeptides in the diagnosis, study, 
prevention, and treatment of diseases and disorders related to body-weight 
regulation, for example, but not limited to, metabolic diseases such as 
obesity as well as related disorders such as metabolic syndrome eating 
disorder, cachexia, diabetes mellitus, hypertension, coronary heart disease, 
hypercholesterolemia, dyslipidemia, osteoarthritis, and/or gallstones. 

There are several metabolic diseases of human and animal metabolism, eg., 
obesity and severe weight loss, that relate to energy imbalance where 
caloric intake versus energy expenditure is imbalanced. Obesity is one of 
the most prevalent metabolic disorders in the world. It is still poorly 
understood human disease that becomes more and more relevant for 
western society. Obesity is defined as an excess of body fat, frequently 
resulting in a significant impairment of health. Cardiovascular risk factors 
like hypertension, high blood levels of triglycerides and fasting glucose as 
well as low blood levels of HDL cholesterol are often linked to obesity. This 
typical cluster of symptoms is commonly defined as "metabolic syndrome" 
(Reaven, 2002, Circulation 106(3): 286-8). Hyperlipidemia and elevation of 
free fatty acids correlate clearly with the metabolic syndrome, which is 
considered to be the linkage between several diseases, including obesity 
and insulin resistance. This often occurs in the same patients and is a 
major risk factor for development of Type 2 diabetes and cardiovascular 
disease. It was suggested that the control of lipid levels and glucose levels 



WO 03/074078 




IPCTYEP03/02363 



- 2 - 

is required to treat Type 2 Diabetes, heart disease, and other occurances of 
metabolic syndrome (see, for example, Santomauro A. T. et al., (1999) 
Diabetes, 48(9): 1 836-1 841 , McCook, 2002, JAMA 288:2709-2716). 

Human obesity is strongly influenced by environmental and genetic factors, 
whereby the environmental influence is often a hurdle for the identification 
of (human) obesity genes. Obesity is influenced by genetic, metabolic, 
biochemical, psychological, and behavioral factors. As such, it is a complex 
disorder that must be addressed on several fronts to achieve lasting 
positive clinical outcome. 

Obesity is not to be considered as a single disorder but a heterogeneous 
group of conditions with (potential) multiple causes. Obesity is also 
characterized by elevated fasting plasma insulin and an exaggerated insulin 
response to oral glucose intake (Koltermann, 1980, J. Clin. Invest 
65:1272-1284). A clear involvement of obesity in type 2 diabetes mellitus 
can be confirmed (Kopelman, 2000, Nature 404:635-643). 

Insulin amongst other hormones plays a key role in the regulation of the 
fuel metabolism. High blood glucose levels stimulate the secretion of 
insulin by pancreatic beta-cells. Insulin leads to the storage of glycogen 
and triglycerides and to the synthesis of proteins. The entry of glucose into 
muscles and adipose cells is stimulated by insulin. In patients who suffer 
from diabetes mellitus either the amount of insulin produced by the 
pancreatic islet cells is to low (Diabetes Type 1 or insulin dependent 
diabetes mellitus, IDDM) or liver and muscle cells loose their ability to 
respond to normal blood insulin levels (insulin resistance). In the next stage 
pancreatic cells become unable to produce sufficient amounts of insulin 
(Diabetes Type II or non insulin dependent diabetes mellitus NIDDM). 

Even if several candidate genes have been described which are supposed 
to influence the homeostatic system(s) that regulate body mass/weight, 
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like leptin, VCPI, VCPL, or the peroxisome proliferator-activated 
receptor-gamma co-activator, the distinct molecular mechanisms and/or 
molecules influencing obesity or body weight/body mass (and thus 
metabolic syndrome) regulations are not known. 

Therefore, the technical problem underlying the present invention was to 
provide for means and methods for modulating (pathological) metabolic 
conditions influencing body-weight regulation and/or energy homeostatic 
circuits. The solution to said technical problem is achieved by providing the 
embodiments characterized in the claims. 

Accordingly, the present invention relates to genes with novel functions in 
body-weight regulation, energy homeostasis, metabolism, and obesity. The 
present invention discloses a specific gene involved in the regulation of 
body-weight, energy homeostasis, metabolism, and obesity, and thus in 
disorders related thereto such as metabolic syndrome, eating disorder, 
cachexia, diabetes mellitus, hypertension, coronary heart disease, 
hypercholesterolemia, dyslipidemia, osteoarthritis, and gallstones. The 
present invention describes the human homologs of the Drosophila 
CG3842 gene as being involved in those conditions mentioned above. 

The acyl-CoA dehydrogenase (Acad or ACAD) gene family of enzymes 
includes very-long-chain (VLCAD), medium-chain (MCAD), and short-chain 
(SCAD) acyl-CoA dehydrogenases. The short-chain dehydrogenases/ 
reductases family (SDR) constitute a large and diverse family of enzymes 
of ancient origin. Several of its members play an important role in human 
physiology and disease, especially in the metabolism of steroid substrates 
(e.g., prostaglandins, estrogens, retinoids, androgens, and corticosteroids). 
Their involvement in common human disorders such as endocrine-related 
cancer, osteoporosis, and Alzheimer disease makes them important 
candidates for drug targets. 



ECT/EP03/02363 

WO 03/074078 

- 4 - 

The Drosophila gene of this invention (GadFly Accession Number CG3842) 
has one aminotermina. transmembrane domain and to a large extend 
exhibits a predicted secondary structure motif characteristic of short-cha.n 
alcohol dehydrogenases (adh-short motif; e.g., from amino acid 73 to 
amino acid 328 in the protein of 406 amino acids length). Three human 
homologous proteins were identified in this invention. These prote.ns are 
the unnamed protein XP_085058 (BAB7081 1 , also referred to as DG21-1 
herein) and CGI-82 (also referred to as prostate short-chain dehydrogenase 
reductase 1, PSDR 1, or DG21-2 herein), and also PAN2. 

The human PSDR1 (CGI-82) gene was identified . by comparative genomics 
(Lai etal 2000, Genome Res 10(5):703-71 3). PSDR1 is highly expressed 
in the prostate gland and a function in the androgen receptor-regulated 
gene network of the human prostate was suggested. Genes regulated by 
androgenic hormones are of critical importance for the norma, physio.og.ca. 
function of the human prostate gland, and they contribute to the 
development and progression of prostate carcinoma (Lin et al., 2001, 
Cancer Res 61 (4):1 61 1-1618). 

Human BAB70811 (unnamed protein XP.085058) cDNA was isolated as 
part of a sequencing project from human brain (cerebre.lum) tissue. No 
functional data were available at the time the invention was made. 

The human PAN 2 protein has been submitted to the NCBI Genbank 
recently (GenBank Accession Number NP_065965; submitted February 1 0, 
2002 by Brereton et al.). PAN2 has been described as member of the 
SCAD superfamily. 

So far it has not been described that CG3842 encoded protein and closely 
, re.ated proteins, particularly human proteins unnamed protein XP_085058, 
CGI-82 and PAN2, are involved in the regulation of energy homeostas.s 
and body-weight regulation and related disorders, and thus, no functions in 
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metabolic diseases and other diseases as listed above have been 
discussed. 

In this invention we demonstrate that the correct gene dose of CG3842 is 
essential for maintenance of energy homeostasis. A genetic screen was 
used to identify that mutation of a CG3842 homologous gene causes 
obesity, reflected by a significant increase of triglyceride content, the major 
energy storage substance. 

Polynucleotides encoding a protein with homologies to CG3842 are 
suitable to investigate diseases and disorders as described above. Further 
new compositions useful in diagnosis, treatment, and prognosis of diseases 
and disorders as described above are provided. 

Before the present proteins, nucleotide sequences, and methods are 
described, it is understood that this invention is not limited to the particular 
methodology, protocols, cell lines, vectors, and reagents described as 
these may vary. It is also to be understood that the terminology used 
herein is for the purpose of describing particular embodiments only, and is 
not intended to limit the scope of the present invention which will be 
limited only by the appended claims. Unless defined otherwise, all technical 
and scientific terms used herein have the same meanings as commonly 
understood by one of ordinary skill in the art to which this invention 
belongs. Although any methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the present 
invention, the preferred methods, devices, and materials are now 
described. All publications mentioned herein are incorporated herein by 
reference for the purpose of describing and disclosing the cell lines, 
vectors, and methodologies which are reported in the publications which 
might be used in connection with the invention. Nothing herein is to be 
construed as an admission that the invention is not entitled to antedate 
such disclosure. 
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The present invention discloses that CG3842 homologous proteins are 
regulating the energy homeostasis and fat metabolism especially the 
metabolism and storage of triglycerides, and polynucleotides, which 
identify and encode the proteins disclosed in this invention. The invention 
also relates to vectors, host cells, antibodies, and recombinant methods for 
producing the polypeptides and polynucleotides of the invention. The 
invention also relates to the use of these polynucleotides, polypeptides and 
effectors thereof, e.g. antibodies, aptamers, anti-sense molecules, 
ribozymes or other receptors recognizing a nucleic acid molecule or 
polypeptide homologous to CG3842 in the diagnosis, study, prevention, 
and treatment of diseases and disorders, for example, but not limited to, 
metabolic diseases such as obesity as well as related disorders such as 
metabolic syndrome, eating disorder, cachexia, diabetes mellitus, 
hypertension, coronary heart disease, hypercholesterolemia, dyslipidemia, 
osteoarthritis and/or gallstones. 

The term "polynucleotide comprising the nucleotide sequence as shown in 
GenBank Accession number" relates to the expressible gene of the 
nucleotide sequences deposited under the corresponding GenBank 
Accession number. The term "GenBank Accession Number" relates to NCBI 
GenBank database entries (Ref.: Benson et al., (2000) Nucleic Acids Res. 
28: 15-18). 

CG3842 homologous proteins and nucleic acid molecules coding therefore 
are obtainable from insect or vertebrate species, e.g. mammals or birds. 
Particularly preferred are human homologous nucleic acids, particularly 
nucleic acids encoding a human unnamed protein, a human CGI-82 protein, 
or PAN2 protein. 

The invention particularly relates to a nucleic acid molecule encoding a 
polypeptide contributing to regulating the energy homeostasis and the 
metabolism of triglycerides, wherein said nucleic acid molecule comprises 
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(a) the nucleotide sequence of or a nucleotide sequence encoding an 
unnamed protein (SEQ ID NO: 1; GenBank Accession Number 
XM_085058), human CGI-82 (SEQ ID NO: 3; GenBank Accession 
Number NM_01 6026), or PAN 2 (GenBank Accession Number 
NM 020905), or GadFly Accession Number CG3842 and/or a 
sequence complementary thereto, 

(b) a nucleotide sequence which hybridizes at 50°C in a solution 
containing 1 x SSC and 0.1 % SDS to a sequence of (a), 

(c) a sequence corresponding to the sequences of (a) or (b) within the 
degeneration of the genetic code, 

(d) a sequence which encodes a polypeptide which is at least 85%, 
preferably at least 90%, more preferably at least 95%, more 
preferably at least 98% and up to 99,6% identical to the amino acid 
sequence of CG3842 homologous proteins, 

(e) a sequence encoding a CG3842 homologous protein, preferably a 
human CG3842 homologous protein unnamed protein' with SEQ ID 
NO: 2; GenBank Accession Number XPJ385058), CGI-82 protein 
(SEQ ID NO: 4; GenBank Accession Number NPJ3571 1 0), or PAN2 
(GenBank Accession Number NP_065956), and/or a sequence 
complementary thereto, 

(f) a sequence which differs from the nucleic acid molecule of (a) to (d) 
by mutation and wherein said mutation causes an alteration, 
deletion, duplication and/or premature stop in the encoded 
polypeptide or 

(g) a partial sequence of any of the nucleotide sequences of (a) to (e) 
having a length of at least 1 5 bases, preferably at least 20 bases, 
more preferably at least 25 bases and most preferably at least 50 
bases. 

The invention is based on the finding that CG3842 homologous proteins, 
particularly proteins of the SCAD family as defined above (herein referred 
to as CG3842 or CG3842 homologous proteins), and the polynucleotides 
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encoding these are involved in the regulation of triglyceride storage and 
therefore energy homeostasis. The invention describes the use of these 
compositions for the diagnosis, study, prevention, or treatment of diseases 
and disorders related thereto, including metabolic syndrome. 

Accordingly, the present invention relates to genes with novel functions in 
body-weight regulation, energy homeostasis, metabolism, and obesity, 
fragments of said genes, polypeptides encoded by said genes or fragments 
thereof, and effectors e.g. antibodies, biologically active nucleic acids, 
such as antisense molecules, RNAi molecules or ribozymes, aptamers, 
peptides or low-molecular weight organic compounds recognizing said 
polynucleotides or polypeptides. 

The ability to manipulate and screen the genomes of model organisms such 
as the fly Drosophila melanogaster provides a powerful tool to analyze 
biological and biochemical processes that have direct relevance to more 
complex vertebrate organisms due to significant evolutionary conservation 
of genes, cellular processes, and pathways (see, for example, Adams M. 
D. et al., (2000) Science 287: 2185-2195). Identification of novel gene 
functions in model organisms can directly contribute to the elucidation of 
correlative pathways in mammals (humans) and of methods of modulating 
them. A correlation between a pathology model (such as changes in 
triglyceride levels as indication for metabolic syndrome including obesity) 
and the modified expression of a fly gene can identify the association of 
the human ortholog with the particular human disease. In one embodiment, 
a forward genetic screen is performed in fly displaying a mutant phenotype 
due to misexpression of a known gene (see, Johnston Nat Rev Genet 3: 
176-188 (2002); Rorth P., (1996) Proc Natl Acad Sci U S A 93: 
12418-12422). 

Triglycerides are the most efficient storage for energy in cells, and obese 
people mainly show a significant increase in the content of triglycerides. In 
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this invention, we have used a genetic screen to identify mutations that 
cause changes in the body weight which is reflected by a significant 
change of triglyceride levels. To isolate genes with a function in energy 
homeostasis, several thousand EP-lines were tested for their triglyceride 
content after a prolonged feeding period (see Examples for more detail). 
Lines with significantly changed triglyceride content were selected as 
positive candidates for further analysis. The change of triglyceride content 
due to the loss of a gene function suggests gene activities in energy 
homeostasis in a dose dependent manner that control the amount of 
energy stored as triglycerides. 

A resource for screening was a proprietary Drosophila melanogaster stock 
collection of PX-lines. The P-vector of this collection has Gal4-UAS-binding 
sites fused to a basal promoter that can transcribe adjacent genomic 
Drosophila sequences upon binding of Gal4 to UAS-sites. This enables the 
PX-line collection for overexpression of endogenous flanking gene 
sequences. In addition, without activation of the UAS-sites, integration of 
the EP-element into the gene is likely to cause a reduction of gene activity, 
and allows determining its function by evaluating the loss-of-function 
phenotype. 

In one embodiment, flies homozygous for the integration of vectors for 
Drosophila line PX2287.1 were analyzed in an assay measuring the 
triglyceride contents of these flies, illustrated in more detail in the 
EXAMPLES section of the invention. The result of the triglyceride content 
analysis is shown in FIGURE 1 . The average triglyceride level of the fly 
collection in which the PX2287.1 line was found is shown as 100% in 
FIGURE 1 (First column, "TG010419, n = 60 n ). The average increase of 
triglyceride content of the homozygous viable Drosophila line PX2287.1 is 
80% (see FIGURE 1, second column, line "2287.1"). It was found in this 
invention that homozygous PX2287.1 flies have a significant higher 
triglyceride content than the control flies tested. The increase of 
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triglyceride content due to the loss of a gene function suggests gene 
activities in energy homeostasis in a dose dependent manner that controls 
the amount of energy stored as triglycerides. 

Nucleic acids encoding the Drosophila GadFly Accession Number CG3842 
homologous proteins of the present invention were identified using an iPCR 
technique. Genomic DNA sequences were isolated that are localized 
adjacent to the EP vector (herein PX2287.1) integration. Using those 
isolated genomic sequences public databases like Berkeley Drosophila 
Genome Project (GadFly; see also FlyBase (1999) Nucleic Acids Research 
27:85-88) or GenBank (NCBI) were screened thereby confirming the 
homozygous viable integration site of the PX2287.1 vector 542 base pairs 
downstream of the coding seqence of a gene, identified as Berkeley 
Drosophila Genome Project Accession Nr. CG3842 (FIGURE 2). FIGURE 2 
shows the molecular organization of this gene locus. 

Therefore, expression of the cDNA encoding GadFly Accession Number 
CG3842 could be effected by homozygous viable integration of vectors of 
line PX2287.1, leading to increase of the energy storage triglycerides. 

The Drosophila genes and proteins encoded thereby with functions in the 
regulation of triglyceride metabolism were further analysed in publicly 
available sequence databases (see EXAMPLES for more detail) and 
mammalian homologs were identified (see FIGURE 3). 

The present invention is further describing a polypeptide comprising the 
amino acid sequence of CG3842. A comparison (Clustal X (1 .81) analysis) 
between the CG3842 proteins of different species (human and Drosophila) 
was conducted (see FIGURE 4). Based upon homology, CG3842 protein of 
the invention and each homologous protein or peptide may share at least 
some activity- No functional data described the regulation of body weight 
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control and related metabolic diseases such as obesity and diabetes are 
available in the prior art for the genes of the invention. 

The function of the mammalian homologs in energy homeostasis was 
further validated in this invention by analyzing the expression of the 
transcripts in different tissues and by analyzing the role in adipocyte 
differentiation. Expression profiling studies (see EXAMPLES for more detail) 
confirm the particular relevance of the protein(s) of the invention as 
regulators of energy metabolism in mammals. For example, transcripts of 
unnamed protein (DG21-1) are more restricted to spleen, bone marrow and 
lung of mammals (FIGURE 6A). CGI-82 (DG21-2) transcripts show highest 
expression in testis (FIGURE 7A). In addition, the proteins of the invention 
are also clearly expressed in white adipose tissue (WAT); unnamed protein 
shows high levels of expression especially in WAT (see FIGURE 6A). Thus 
an expression of the protein of the invention in adipose tissues is 
confirming a role in the regulation of energy homeostasis and 
thermogenesis. 

Further, we show that the proteins of the invention are regulated by fasting 
and by genetically induced obesity, in this invention, we used mouse 
models of insulin resistance and/or diabetes, such as mice carrying gene 
knockouts in the leptin pathway (for example, ob (leptin) or db (leptin 
receptor) mice) to study the expression of the protein of the invention. 
Such mice develop typical symptoms of diabetes, show hepatic lipid 
accumulation and frequently have increased plasma lipid levels (see 
Bruning et al, 1998, Mol. Cell. 2:449-569). We found, for example, that 
the expression of unnamed protein (DG21-1) is strongly upregulated 
especially in the pancreas of fasted mice (see FIGURE 6B). The expression 
of CGI-82 (DG21-2) is strongly upregulated in spleen and kidney of fasted 
mice (FIGURE 7B). In addition, a marked upregulation of DG21-1 and 
DG21-2 can be observed in the metabolically active tissue (for example, 
DG21-1 in brown adipose tissue (BAT) and DG21-2 in WAT) of genetically 
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obese (ob/ob) as well as of fasted mice (see FIGURE 6B, and FIGURE 7B, 
respectively). 

In addition, DG21-1 (unnamed protein) mRNA is upregulated in pancreas 
and BAT of mice with symptoms of diabetes, lipid accumulation, and high 
plasma lipid levels, if fed a high fat diet (HFD) (FIGURE 6C). DG-21-2 
(homolog to CGI-82 = PSDR1) is strongly up regulated in white adipose 
tissue (WAT) of ob/ob mice compared to wt mice (FIGURE 7B) and also in 
BAT and muscle of mice fed a HFD compared to mice fed a standard diet 
(FIGURE 7C). These results provide evidence that the proteins of the 
invention are modulators of adipogenesis. 

In addition, we show in this invention that the mRNA of unknown protein 
(DG21-1) is significantly up-regulated during adipocyte differentiation in 
vitro (see EXAMPLES for more detail, see FIGURE 6D, FIGURE 6E, FIGURE 
6F), suggesting a role as modulator of adipocyte lipid accumulation. Thus, 
we conclude that the protein of the invention (or variants thereof) have a 
function in the metabolism of mature mammalian adipocytes. 

Glucose is taken up by the cells rapidly and stored in the form of glycogen 
primarily used for the metabolic demands of the cell. An increase in cellular 
glycogen levels as a consequence of overexpression of unnamed protein 
XP085058 (see FIGURE 8A) could reflect an elevated glucose uptake, a 
higher glycogen synthesis rate or a decreased energy consumption and 
thus confirms a role of unnamed protein XP_085058 in metabolic 
regulation. 

In addition, we show in this invention that cells overexpressing unnamed 
protein XPJD85058 are more effective in synthesising lipids from 
exogenous glucose (see FIGURE 8E). Consequently, levels of insulin 
stimulated lipid synthesis of cells overexpressing unnamed protein 
XP 085058 are also increased. The increased level of lipid synthesis from 
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exogenous glucose in cells overexpressing unnamed protein XP_085058 
demonstrate an activation of enzymes involved in lipid synthesis. 

The profound increase in uptake of free fatty acids of cells overexpressing 
unnamed protein XP_085058 (see FIGURE 8C) and the decrease of the free 
fatty acid esterification in CGI-82 overexpressing cells (see FIGURE 8D) 
could be due to a direct action of the proteins of the invention or 
alternatively due to a role in the regulation of free fatty acid uptake and 
esterification. The proteins of the invention may for example play a role in 
converting regulatory inactive retinoids or steroids in regulatory active 
hormone derivatives which then influence fatty acid metabolism directly or 
regulate the gene expression of fatty acid metabolic enzymes or 
transporters on the transcriptional level. 

The invention also encompasses polynucleotides that encode CG3842 and 
homologous proteins. Accordingly, any nucleic acid sequence, which 
encodes the amino acid sequences of CG3842 homologous proteins, can 
be used to generate recombinant molecules that express CG3842 
homologous proteins. It will be appreciated by those skilled in the art that 
as a result of the degeneracy of the genetic code, a multitude of nucleotide 
sequences encoding CG3842 homologous proteins, some bearing minimal 
homology to the nucleotide sequences of any known and naturally 
occurring gene, may be produced. Thus, the invention contemplates each 
and every possible variation of nucleotide sequence that could be made by 
selecting combinations based on possible codon choices. 

Also encompassed by the invention are polynucleotide sequences that are 
capable of hybridizing to the claimed nucleotide sequences, and in 
particular, those of the polynucleotides comprising the nucleic acid 
sequence encoding a Drosophila protein (GadFly Accession Number 
CG3842) and homologous human proteins. Hybridization conditions are 
based on the melting temperature (Tm) of the nucleic acid binding complex 
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or probe, as taught in Wahl, G. M. and S. L. Berger (1987: Methods 
Enzymol. 152:399-407) and Kimmel, A. R. (1987; Methods EnzymoL 
152:507-511), and may be used at a defined stringency. Preferably, 
hybridization under stringent conditions means that after washing for 1 h 
with 1 x SSC and 0.1% SDS at 50°C, preferably at 55 °C, more preferably 
at 62°C and most preferably at 68°C, particularly for 1 h in 0.2 x SSC and 
0.1 % SDS at 50°C, preferably at 55°C, more preferably at 62°C and most 
preferably at 68°C, a positive hybridization signal is observed. Altered 
nucleic acid sequences encoding CG3842, which are encompassed by the 
invention include deletions, insertions, or substitutions of different 
nucleotides resulting in a polynucleotide that -encodes the same or a 
functionally equivalent CG3842 homologous proteins. 

The encoded proteins may also contain deletions, insertions, or 
substitutions of amino acid residues, which produce a silent change and 
result in a functionally equivalent CG3842 homologous proteins. Deliberate 
amino acid substitutions may be made on the basis of similarity in polarity, 
charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues as long as the biological activity of CG3842 
homologous proteins is retained. Furthermore, the invention relates to 
peptide fragments of the proteins or derivatives thereof such as cyclic 
peptides, retro-inverso peptides or peptide mimetics having a length of at 
least 4, preferably at least 6 and up to 50 amino acids. 

Also included within the scope of the present invention are alleles of the 
genes encoding CG3842 homologous proteins. As used herein, an "allele" 
or "allelic sequence" is an alternative form of the gene, which may result 
from at least one mutation in the nucleic acid sequence. Alleles may result 
in altered mRNAs or polypeptides whose structures or function may or may 
not be altered. Any given gene may have none, one, or many allelic forms. 
Common mutational changes, which give rise to alleles, are generally 
ascribed to natural deletions, additions, or substitutions of nucleotides. 
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Each of these types of changes may occur alone, or in combination with 
the others, one or more times in a given sequence. 

The nucleic acid sequences encoding CG3842 homologous proteins may 
be extended utilizing a partial nucleotide sequence and employing various 
methods known in the art to detect upstream sequences such as 
promoters and regulatory elements. For example, one method which may 
be employed, "restriction-site" PCR, uses universal primers to retrieve 
unknown sequence adjacent to a known locus (Sarkar, G. (1993) PCR 
Methods Applic. 2:318-322). Inverse PCR may also be used to amplify or 
extend sequences using divergent primers based on a known region 
(Triglia, T. et al. (1988) Nucleic Acids Res. 16:8186). Another method 
which may be used is capture PCR which involves PCR amplification of 
DNA fragments adjacent to a known sequence in human and yeast artificial 
chromosome DNA (Lagerstrom, M. et al. (PCR Methods Applic. 
1:111-119). Another method, which may be used to retrieve unknown 
sequences is that of Parker, J. D. et al. (1991; Nucleic Acids Res. 
19:3055-3060). Additionally, one may use PCR, nested primers, and 
PROMOTERFINDER libraries to walk in genomic DNA (Clontech, Palo Alto, 
Calif.). This process avoids the need to screen libraries and is useful in 
finding intron/exon junctions. 

In order to express a biologically active CG3842 homologous proteins, the 
nucleotide sequences encoding CG3842 homologous proteins or functional 
equivalents, optionally fused to heterologous sequences may be inserted 
into appropriate expression vectors, i.e., a vector, which contains the 
necessary elements for the transcription and translation of the inserted 
coding sequence. Methods, which are well known to those skilled in the 
art, may be used to construct expression vectors containing sequences 
encoding CG3842 homologous proteins and appropriate transcriptional and 
translational control elements. These methods include in vitro recombinant 
DNA techniques, synthetic techniques, and in vivo genetic recombination. 
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Such techniques are described in Sambrook, J. et al. (1989) Molecular 
Cloning, A Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y., 
and Ausubel, F. M. et al. (1989) Current Protocols in Molecular Biology, 
John Wiley & Sons, New York, N.Y. 

A variety of expression vector/host systems may be utilized to contain and 
express sequences encoding CG3842 homologous proteins. These include, 
but are not limited to, micro-organisms such as bacteria transformed with 
recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; 
yeast transformed with yeast expression vectors; insect cell systems 
infected with virus expression vectors (e.g. r baculovirus); plant cell 
systems transformed with virus expression vectors (e.g., cauliflower 
mosaic virus, CaMV; tobacco mosaic virus, TMV) or with bacterial 
expression vectors (e.g., Ti or PBR322 plasmids); or animal cell systems. 

The presence of polynucleotide sequences encoding CG3842 homologous 
proteins can be detected by DNA-DNA or DNA-RNA hybridization and/or 
amplification using probes or portions or fragments of polynucleotides 
specific for CG3842 homologous proteins. Nucleic acid amplification based 
assays involve the use of oligonucleotides or oligomers based on the 
sequences encoding CG3842 homologous proteins to detect transformants 
containing DNA or RNA encoding CG3842 homologous proteins. As used 
herein "oligonucleotides" or "oligomers" refer to a nucleic acid sequence of 
at least about 10 nucleotides and as many as about 60 nucleotides, 
preferably about 1 5 to 30 nucleotides, and more preferably about 20-25 
nucleotides, which can be used as a probe or amplimer. 

A variety of protocols for detecting and measuring the expression of 
CG3842 homologous proteins, using either polyclonal or monoclonal 
antibodies specific for the protein are known in the art. Examples include 
enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based 
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immunoassay utilizing monoclonal antibodies reactive to two 
non-interfering epitopes on CG3842 homologous proteins is preferred, but 
a competitive binding assay may be employed. These and other assays are 
described, among other places, in Hampton, R. et al. (1990; Serological 
Methods, a Laboratory Manual, APS Press, St Paul, Minn.) and Maddox, D. 
E. et al. (1983; J. Exp. Med. 158:1211-1216). 

A wide variety of labels and conjugation techniques are known by those 
skilled in the art and may be used in various nucleic acid and amino acid 
assays. Means for producing labeled hybridization or PCR probes for 
detecting sequences related to polynucleotides encoding CG3842 
homologous proteins include oligo-labeling, nick translation, end-labeling or 
PCR amplification using a labeled nucleotide. 

Alternatively, the sequences encoding CG3842 homologous proteins, or 
any portions thereof may be cloned into a vector for the production of an 
mRNA probe. Such vectors are known in the art, are commercially 
available, and may be used to synthesize RNA probes in vitro by addition 
of an appropriate RNA polymerase such as T7, T3, or SP6 and labeled 
nucleotides. These procedures may be conducted using a variety of 
commerciaily available kits (Pharmacia & Upjohn, (Kalamazoo, Mich.); 
Promega (Madison Wis.); and U.S. Biochemical Corp., (Cleveland, Ohio). 

Suitable reporter molecules or labels, which may be used, include 
radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic 
agents as well as substrates, co-factors, inhibitors, magnetic particles, and 
the like. 

Host cells transformed with nucleotide sequences encoding CG3842 
homologous proteins may be cultured under conditions suitable for the 
expression and recovery of the protein from cell culture. The protein 
produced by a recombinant cell may be secreted or contained intracellular^ 
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depends on the scenes and,or the vector used. As wi» be understood 
by those of skill in the art, expression vectors containing polynucleotrdes 
which encode CG3842 homologous proteins may be designed to conta.n 
signal sequences, which direct secretion of CQ3842 homo.ogous proterns 
through a prokaryotic or eukaryotic cel, membrane. OtheyecombUran, 
oonstructions may be used to join sequences encodmg CG3842 
homologous proteins to nucleotide sequence encoding a poiypeptide 
domain, which will facilitate purification of soluble proteins. Such 
purification facilitating domains include, but are not limited to. metal 
cheating peptides such as histidine-tryptophan . modules that a low 
purification on immobilized metals, protein A domains that allow 
purification on immobilized immunoglobulin, and the domain u,,„zed ,n the 
FLAG extension/affinity purification system (Immunex Corp.. Seattle. 
Wash., The inclusion of cleaveble ,inker sequences such as those specific 
for Factor XA or Enterokinase Onvitrogen. San Diego. Calif.) between the 
purification domain and CG3842 homologous proteins may be used to 
facilitate purification. In addition to recombinant production, fragments of 
CG3842 homologous proteins may be produced by direct peptide synthes.s 
using solid-phase techniques (Merrifield J. (1963) J. Am. Chem. Soc 
85-2149-2154). Protein synthesis may be performed using manual 
techniques or by automation. Automated synthesis may be achieved, for 
example, using Applied Biosystems 431 A peptide synthes.zer (Perk.n 
Elmer) Various fragments of CG3842 homologous proteins may be 
chemically synthesized separately end combined using chemical methods 
to produce the full length molecule. 

Diagnostics and Therapeutics 

The data disclosed in this invention show that the nucleic acids and 
, ptoteins of the invention and effector molecules thereof are useful ,n 
diagnostic and therapeutic applications implicated, for example but not 
IMted to. in metabolic disorders such as obesity as well as related 
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■ . . r rarhexia diabetes meilitus, 
l « eatnq disorder, cacnexia, 

disorders such as hypercho , e s,erolemia. dyslipidemia, 

hvDertension, coronary heart disease, nyw 

0 eTarthrms and,or ga.lstones. Henca. diagnostic and ,hera P eut,c us s o 

1 CG3842 homologous proteins nucleic acids and pro,e,ns of the 

L .or exemple bu, no, limited to, the following: (i. prote.n 
inV ent,on are r ™£ ^ (iffl antibody t3 ,get 

ZZ. drug —cytotoxic antlbodyK flv, diagno- 

L/orprognostic ma*er. (v) gene therapy ,ene 

w tftols an d (vii) tissue regeneration in vitro and 
S£2 ~ tissues and cel, types composing these tissues 

and cell types derived from these tissues). 

The nucleic acids and proteins of the invention are useful in diagnostic and 
hJrapeutic applications implicated in various applications as - 
1 , For example, but not limited to, cDNAs encoding the CG3842 
be ,ow. For example thejr human homo logues may be 

nroteins of the invention and particularly 

proteins or cG384 2 proteins of the .nvent.on and 

useful in gene therapy, and the v 

p Iu,ar,V their humen homologues may be useful when 

subect ^ need thereof. By way of non-limiting example, he 
a suDjeut ^ffiracv for treatment of 

— -rrt . — 

disorders as described above. 

The novel nuoleio aoid encoding the CG3842 homologous proteins o, the 
2 o, fragments thereof, may further be useful in d.sgnost.c 
• ZZ on « -in the presence or amount o, the nucleic acUfs or the 
.he assessed These materials are further useful ,n the 
rirof MX - bind immunospecifically to the nova, 
substances o, the invention for use In therapeutic or diagnost.c methods. 

" for example, in one aspect, antibodies the, are specific for 003842 
homologous proteins may be used diredy as en an,agon,s, or ,nd,rec,,y 
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a targeting o, delivery mechanism ,or bringing a pharmaceutic* 
Lis or tissue which express CG3842 hemcicgous proteins. The an ho* 
mav he generated using methods that are we,, Known in the art. Such 
anybodies may indude. but are not limited to. polyclonal. monodonaU 
clerics, singie chain. Feb fragments, and fragments produce by a « 

«• • ~ ^+\wnA\(*<z i\ e those which inhibit dimer 
expression library. Neutralizing antibodies (i.e.. too 

formation) are especially preferred for therapeutic use. 

For the production of antibodies, various hosts including goa,s rabbits 
1 mi e. humans, and others, may be immunized by in.ection with 
C33842 homologous proteins any fragment or oligopeptide thereof wh^ch 
h as immunogenic properties. Depending on the host species va 

, m =v be used to increase immunological response. Such 

a, minium hydroxide, and surface active substances such as lyso e th 
p,uronic polyols, polyanions. peptides. o„ emulsion, Keyho, Umpe, 
hemocyanin. and dinitrophenol. Among adiuvants used ,n human, BCG 
(B aci„e Calmette-Guerin, and Corynebacterium parvum are espec ally 
e ereble. It is preferred that the peptides, fragments, or oligopeptide 
ed to induce antibodies to CG3842 homologous proteins have an ammo 
a d seance consisting of a. least five amino acids, and more preferably 
east 10 amino acids. I, is preferable that they are identical to a portion 
1 amino acid seguence o, the nature, protein, and they may conta n 
th e entire amino acid seguence of a small, naturally occurring molecule^ 
, Short stretches of CG3842 homologous proteins amino acids may be fused 
with those of another protein such as Keyhole limpet hemocyenin and 
antibody produced against the chimeric molecule. 

Monoclone, antibodies to CG3842 homologous proteins may be prepared 
„ using any technigue which provides for the production . n lbody 
m olecules by continuous cell lines in culture. These include, but are 
rid to. the hybridoma technigue, the human 8-ce„ hybridoma 
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technique, and the EBV-hybridoma technique (Kohler, G. et al. (1975) 
Nature 256:495-497; Kozbor, D. et al. (1985) J. Immunol. Methods 
81 :31-42; Cote, R. J. et al. Proc. Natl. Acad. Sci. 80:2026-2030; Cole, S. 
P. et al. (1984) Mol. Cell Biol. 62:109-120). 

In addition, techniques developed for the production of "chimeric 
antibodies", the splicing of mouse antibody genes to human antibody 
genes to obtain a molecule with appropriate antigen specificity and 
biological activity can be used (Morrison, S. L. et al. (1984) Proc. Natl. 
Acad. Sci. 81:6851-6855; Neuberger, M. S. et al (1984) Nature 
312:604-608; Takeda, S. et al. (1985) Nature 314:452-454). 
Alternatively, techniques described for the production of single chain 
antibodies may be adapted, using methods known in the art, to produce 
CG3842 homologous proteins -specific single chain antibodies. Antibodies 
with related specificity, but of distinct idiotypic composition, may be 
g enera ted by chain shuffling from random combinatorial immunoglobulin 
libraries (Burton, D. R. (1991) Proc. Natl. Acad. Sci. 88:11120-3). 
Antibodies may also be produced by inducing in vivo production in the 
lymphocyte population or by screening recombinant immunoglobulin 
libraries or panels of highly specific binding reagents as disclosed in the 
literature (Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. 86:3833-3837; 
Winter, G. et al. (1991) Nature 349:293-299). 

Antibody fragments, which contain specific binding sites for CG3842 
homologous proteins, may also be generated. For example, such fragments 
include, but are not limited to, the F(ab') 2 fragments which can be 
produced by pepsin digestion of the antibody molecule and the Fab 
fragments which can be generated by reducing the disulfide bridges of 
F(ab') 2 fragments. Alternatively, Fab expression libraries may be 
constructed to allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity (Huse, W. D. et al. (1989) Science 
254:1275-1281). 
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Various immunoassays may ba usad for screening to identify antibodies 
having tha desired specificity. Numarous protocols for competitive b.nd.ng 
and immunoradiometric assays using either polyclona, or monoclone 
antibodies with astabiishad specificities ara we., Known in tha art. Such 
immunoassays typica.W invoive tha maasuramant o, comp-ex orm „on 
between CG3842 homologous protains and its spechc antibody. A 
two-site, monoo,„na.-basad immunoassay utiiizing monoolona, ant,bod,as 
r aactive to two non-intarfaring CG3842 homologous protains eprtopes ,s 
preferred, but a competitive binding assay may also ba ampioyad (Maddox. 
supra). 

,n anotbar embodiment of tha invention, the polynuciaotidas encoding 
CG3842 homo,ogous protains. or any fragman, tharaof. or nucle.c ac,d 
affector molecules such as antisansa molaouias. aptamars. RNA, molecu.es 
or ribozymes may ba usad for tharapautic purposas. In one ^aspect 
aptamars. i.e. nucleic aoid molaoulas. which are capable o, b,n ,ng ,0 
CG3842 protain and modulating its activity, may ba ganaratad by a 
screening and selection procadura involving tha usa of combinatory 
nucleic acid libraries. 

, n a furthar aspact. antisense to tha polynuclaotida anooding CG3842 
bamologous protains may ba usad in situations in which „ would ba 
dasirabla to block tha transcription of the mRNA. In particular, calls may ba 
tr ans.ormad with sequences complamantary to polynuciaotidas ancod.ng 
. CG3842 homologous protains. Thus, antisansa molaculas may ba used to 
modulata CG3842 homologous protains activity, or to achiava ragulabon of 
9 ana function. Such tachno.ogy is now wal, know in tha art. and sense or 
ntisansa oligomars or largar fragmants. can ba dasignad from var- 
iations along the coding or control regions of sequences encoding 
, CG3842 homologous protains Expression vectors denved from 
' retroviruses, adenovirus, herpes or vaccinia viruses, or from venous 
bacteria, plasmids may be used for delivery of nucleotide seguancas to tha 
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targeted organ, tissue or ce„popu,ation. Methods, which are we,, known 
tse sld in the art. oan he used to construe, recomhinen, vector, 
wnic h wi„ express antisense moiecufes commentary to 
po , y nuc,eotides o, the gene encoding CG3842 homoiogous proteins. These 
. echnigues are described both in Sambroox et a,, (supra, and in Ausube, e, 
" a (supra,. Genes encoding CG3842 homo.ogous proteins can be turned 
^transforming a cei, or tissue with expression vectors which express 
high ,eve,s of po,ynuc,eo,ide or fragment thereof which encodes CG3842 
h mo-ogous proteins. Such constructs may be used to .ntrodu 
. untrans,a,ab,e sense or antisense seguences into a ce„. Ever , the 
absence o, integration into the DNA. such vectors may 
transcribe RN A mo,ecu,es unti, they are disab,ed by endogenous nuc eases^ 
Transient expression may ,ast .or a month or more with a non-rep , t mg 
v ec,or and even ,onger if appropriate repiication e.ements are part o, the 
15 vector system. 

As mentioned above, modifications of gene expression can be obtained by 
designing antisense mo,ecu,es. DNA. RNA, or nuCeic acid ena.ogues such 
PNA. to the contro, regions of the gene encoding CG3842 homoiogous 
, oteins. ..... .he promoters, enhancers, and introns. 

derived from the transcription inmation site. e.g.. between posrtions -10 

d + 10 from the star, site, are preferred. Simi,ar,y. inhibition can b 
acbieved using "tripie he,ix" base-pairing methodoiogy. Tripie be„x pa.nng 

25 open sufficientiy for the binding of poiymerases, transcription factors, or 
r e gul a,ory moiecuies. Recent therapeutic advances using tripiex DNA have 
Jen described in the Uterature ,Gee. J. E. et a,. (1994, ,n : Huber U 
and 8 ,. Carr. Mo,ecu,ar and ,mmuno,ogic Approaches. Future PubWsh.ng 
Co.. Mt. Kisco. M.Y.). The antisense mo,ecu,es may a,so be designed to 
30 b ,ocK trans,a,ion of mRNA by preventing the transcript from b.nd.ng 
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Rib ozymes. enzymatic RNA mo,ecu,es. may a.so be used to catalyse the 
^ecmc Ceavage of RNA. The mechanism of r.bozyme action .nvoves 
secuence-specifichybridizationCtheribozymemoiecuietocompiementary 

J B et RNA, followed by endonuc.eo.ytic Ceavage. Exemp.es wh,ch may 
b e used, .nc.ude engineered hammerhead mot., r.bozyme mo.ecu.es the 
oen be spec.»,ca..y end efficiency catalyse endonuc,eo,yt.c cleavage of 
fences encoding CG3842 homo.ogous proteins. Specific r.bozyme 
lavage s„es within any potenda, RNA target are .nitiai.y denied by 

follo w.ng seguences: GUA. GUU. and GUC. Once ,den„f,ed sh r RNA 
seguences of between 15 end 20 ribonucleotides correspond^ to the 
e 9 .on of the target gene conta.n.ng the Ceayage site may — a.uated or 
econdary structure, feetures which mey render the o„gonuc.eo,,de 
Zable. The su.,ab,ty o, candidate targets may also be evaluate by 
testing access.bi.ity to hybr.dizat.on with commentary o„gonuc,eot,des 
using ribonuclease protection assays. 

Nuc ,e,c acid effector mo.ecu.es. e.g. antisense * 
th e .nvent.cn may be prepared by any method Known ,n the art for the 
syn thes.s of nuc,e.c acid mo.ecu.es. These .nc.ude fee n,gue 
chemica,.y synthesizing o.igonuc.eotides such as sol.d phase 
"o— chem.ce, synthes.s. A.terne,ve,y. RNA mo.ecu.es may be 
geneLd by in vitro and in v.vo transcr.pt.on of DNA seguences 
generate Such DNA seguences may be incorporated 

CG3842 homologous proteins, bucn uin« m 

, wo a variety of vectors with suitab.e RNA po.ymerase promoters such as 
To S 6. A.fernat.ve.y. these cDN A constructs that synthes.ze anbsense 
RNA cons,„u.ive,y or .nducib.y cen be .ntroduced Into ce„ Unes. ce.,s r 
sues. RNA mo,ecu,es may be mod,ed to .ncrease intracellular stabihty 
and ha,«-,i.e. Possible modifications inc.ude. but are not ,m,ted to the 

, ,h» S' and/or 3' ends of the molecule or 
,„ addition of flanking seguences at the 5 and/or Hio ^ rase 
the use of pbosphorothioate or T O-methy. rather than phosphod.ester e 
Leges wUh.n the backbone of the molecule. Th.s concept ,s rnnerent ,n 
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the production of PNAs and can be extended in e„ of these modules by 
, h e inclusion of non-traditional bases such as inosine. r,ueos,ne and 
wybutosine. as we,, as acety,-. methy,, «hio, and simi.any mod.hed form 
o, adenine, cytidine. guanine, thymine, and uridine which are not as eas.y 
recognized by endogenous endonucleases. 

Many methods for introducing vectors into ce„s or tissues are avai,ab,e and 
suitab,e for use ,n vivo, in vitro, and ex vivo. Po, ex v,vo therapy 
vectors may be introduced into stem ce„s taken from the patient and 
coy Propagated for au.Cogous transient back into that same patient. 
Delivery by transfection and by liposome injections may be achieved us.ng 
' nods, which are we,, known in the art. Any of the therapeutic method 
T scribed above may be appiied to any suitab.e subject ,nc,ud,ng for 

and most preferably, humans. 

An additiona, embodiment of the invention re,ates to the administration of 
Tpharmaceutica, composition, in conjunction with a pharmaceutical 
c p,ab,e carrier, for any c. the therapeutic effects discussed above, 
si pharmaceutic, compositions may consist o, CG3842 homCogous 
proteins, antibodies to CQ3842 homCogous proteins, mimetics. agonrsts 
antagonists or inhibitors o, CG3S42 homologous prote.ns. 

m ons may be adminisfered alone or in combination with at eas 
on e ther agent, such as stabilizing compound, which may e adm,n, e red 
, in any sterile, biocompatible pharmaceutical carrier, rnOud.ng. 

Led to. sa,ine. buffered saline, dextrose, and water. The composrtion 
may be administered ,o a patient a.one, or in combination w, oth 
Tgenta drugs or hormones. The pharmaceutical compositions 

Mention may be administered by any number o, .outes includrn, but 
, limited to oral, intravenous, intramuscular. ,n,ra-arter,al, 
" IS,, -thecal, in,raventricu,a, transdermal, sub—, 
intraperitoneal, intranasal, enteral, topical, sublingual, or recta, means. 
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ln addition to the active ingredients, these pharmaceutics compositions 
may contain suitahie pharmaceuticaUy-acceptable carriers oomp„s,n g 
excipients and auxiliaries, which facilitate process.ng of the actnre 
Ipounds into preparations which, can he used pharmaceutic* Further 
data Is on techniques for formuietion and administration may be found ,n 
IT latest edition of Remington's Pharmaceutical Sciences (Maack 
Publishing Co., Easton, Pa.). 

The pharmaoeutioa, compositions of the present invention may be 
manufactured in a manner that is known in the art, e.g.. by means of 
I"— mixing, dissolving, granuiadng. dragee-making, le.gadng 
emulsifying encapsulating, entrapping, or lyophiliz.ng processes. After 

appropriate container and labeled for treatment of an indicated condrtio . 
PorTdministration o, CG3842 homologous proteins, such ,abe„ng would 
include emoun,. frequency, and method of administrate. 

Pharmaceutical compositions suitable for use in the invention include 
mpositions wherein the active ingredients ere contained ,n an e 

amount to achieve ,he intended purpose. The de,er m ina„on of an 

dl is we,, within the capability of those skilled in the art. For any 
mp unds. the therapeutic* effective dose can be estimated ,n,t,a„y 

compouno Dread ioocyte cell lines, or in an.mal 

either in cell culture assays, e.g.. of pread.pocyt 

m ode,s. usuaUy mice, rabbits, dogs, or pigs. The an.mai mod J o b 
. US ed to determine the appropriate concentration range and route o 
' administration Such information can then be used to determ.ne useful 
I l outes for administration in humans. A therapeutically effective 
dose refers to that amount o, active ingredient, for example CG3842 
nucL acids or proteins or fragments thereof, or an.ibod.es which . 
. rflien, for treating a specific condition. Therapeutic e..,cacy 
homologous proteins and toxicity may be determ.ned by 
pharmaceutic,, procedures in eel, cultures or experimental an.mals. e.g.. 
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ED50 (the dose therapeutically effective in 50% of the population) and 
LD50 (the dose lethal to 50% of the population). The dose ratio between 
therapeutic and toxic effects is the therapeutic index, and it can be 
expressed as the ratio, LD50/ED50. Pharmaceutical compositions, wh.ch 
exhibit large therapeutic indices, are preferred. The data obtained from cell 
culture assays and animal studies is used in formulating a range of dosage 
for human use. The dosage contained in such compositions is preferably 
within a range of circulating concentrations that include the ED50 with 
little or no toxicity. The dosage varies within this range depending upon the 
dosage from employed, sensitivity of the patient, and the route of 
administration. The exact dosage will be determined by the practitioner, ,n 
light of factors related to the subject that requires treatment. Dosage and 
administration are adjusted to provide sufficient levels of the active moiety 
or to maintain the desired effect. Factors, which may be taken ,nto 
account, include the severity of the disease state, general health of the 
subject, age, weight, and gender of the subject, diet, time and frequency 
of administration, drug combination(s), reaction sensitivit.es, and 
tolerance/response to therapy. Long-acting pharmaceutical composes 
m ay be administered every 3 to 4 days, every week, or once every two 
weeks depending on half-life and clearance rate of the particular 
formulation. Norma, dosage amounts may vary from 0.1 to 100,000 
micrograms, up to a total dose of about 1 g, depending upon the route of 
administration. Guidance as to particular dosages and methods of del.very 
is provided in the literature and generally available to practitioners in the 
art Those skilled in the art employ different formulations for nucleotides 
than for proteins or their inhibitors. Similarly, delivery of polynucleotides or 
polypeptides will be specific to particular cells, conditions, locations, etc. 

In another embodiment, antibodies which specifically bind CG3842 
homologous proteins may be used for the diagnosis of conditions or 
diseases characterized by or associated with over- or underexpress.on of 
CG3842, or in assays to monitor patients being treated with CG3842 
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homo.ogous proteins, agonists, antagonists or inhibitors. The antibodies 
useful for diagnostic purposes may be prepared in the same manner as 
those described above for therapeutics. Diagnostic assays for CG3842 
homologous proteins include methods, which utilize the antibody and a 
label to detect CG3842 homologous proteins in human body flu.ds or 
extracts of cells or tissues. The antibodies may be used with or without 
modification, and may be labeled by joining them, either cova.ent.y or 
n on-cova.ent.y, with a reporter mo.ecu.e. A wide variety of reporter 
molecules which are Known in the art may be used severe, of wh.ch are 
described above. 

A variety of protoco.s including ELISA, BIA. and FACS for measuring 
CG3842 homologous proteins are known in the art and provide a bas.s for 
diagnosing altered or abnorma, levels of CG3842 homologous prote.ns 
expression. Norma, or standard values for CQ3842 homologous prote.ns 
expression are established by combining body fluids or cell extracts taken 
from norma, mammalian subjects, preferably human, with antibody to 
0Q3842 homologous proteins under conditions suitable for complex 
formation. The amount of standard complex formation may be guant,f,ed 
bY various methods, but preferably by photometric means. Quantmes of 
CQ3842 homologous proteins expressed in control and disease, samples 
from e.g. biopsied tissues are compared with the standard values. 
Deviation between standard and subject values establishes the parameters 
for diagnosing disease. 

,n another embodiment of the invention, the polynucleotides specific for 
CG3842 homologous proteins may be used for diagnostic purposes. The 
polynucleotides, which may be used, include oligonucleotide sequences, 
antisense RNA and DNA molecules, and PNAs. The polynucleotides may be 
used to detect and guantitate gene expression in biopsied tissues in wh.ch 
expression of CG3842 homologous proteins may be correlated w,th 
disease. The diagnostic assay may be used to distinguish between 
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absence, presence, and excess expression of CG3842 homdogous 
proteins, and ,0 monitor regulation of CG3842 homologous prote.ns levels 
during therapeutic intervention. 

,„ one aspect, hybridization with PGR probes which are capable of 
detecting polynucleotide sequences, induding genomic sequences, 
ncodin CG3842 homdogous proteins Cose. re,ated 
us ed to identify nucleic acid sequences which encode CG3842 
homologous proteins. The specificity o, the probe, whether i, is made from 
72 h, specific region, e.g.. unique nuclides in the 5' regu.atory 
egion. or a less specific region, e.g.. especially in the 3- codrng .egron. 
and the stringency o, ,he hybridization or amplification .mexrmal. h.gh, 
ntermediate. or low, w„, determine whether the probe 
natural* occurring sequences encoding CG3842 homdogous prote.ns 
1. or related sequences. Probes may also be used for the detectron o 
re ,ated sequences, and should preferably conta.n at feast 80 A 
nucleotides from any of the CG3842 homologous prote.ns encod.ng 
sequences. The hybridization probes of the subiec, invent.cn may be DNA 
or RNA and derived from the nucleotide sequence o, the po,ynudeot,de 
comprising the nucleic acid sequence of nucleic acids encod.ng a 
Drosophi, protein (GadF.y Accession Number CG3842, or human 
hlmCogous proteins, or from a genomic sequence induding prom e 
enhancer elements, and introns of the naturally occurr.n, CG3842 
homdogous proteins. Means for producing specific hybridizat.cn probes fo 
DNAs encoding CG 3842 homdogous proteins include the . , g 
nueWe acid sequences encoding CG3842 homologous prote.ns der.vat.ves 
int0 vectors for the production o, mRNA probes. Such vectors are Known 
in the art. commercially available, and may be used ,0 synthes.ze RNA 
probes in vitro by means o, ,he addition of the approbate RNA 
. polymerases and the appropriate labeled nucleotides. Hybnd.za on pr be 
mav be labeled by a variety o, reporter groups, for example, rad.onud.des 
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such as »P or *S. or enzymatic labels, such as alkaline phosphatase 
coupled to the probe via avidin/biotin coupling systems, and the l,ke. 

Polynucleotide sequences encoding CG3842 homologous proteins may be 
used for the diagnosis of conditions or diseases, which are associated w,th 
expression of CG3842 homologous proteins. Examples of such conditions 
or diseases include, but are not limited to. metabolic diseases and 
disorders, induding obesity and diabetes. Polynucleotide sequences 
specific for CG3842 homologous proteins may also be used to monitor the 
plgress o, patients receiving treatment for metabolic diseases and 
disorders, induding obesity and diabetes. The polynucleotide sequences 
specific for CG3842 homologous proteins may be used in Southern or 
Northern anaiysis. do, blot, or other membrane-based technologies; inPCR 
technologies: or in dip stick, pin. ELISA or chip assays uttang fluids or 
Lues from patient biopsies to detect altered CG3842 homologous 
proteins expression. Such qualitative or quantitative methods are well 
known in the art. 

,n a particular aspect, the nucleotide sequences specific for CG3842 
oomolog proteins may be useful in assays the, detect activation or 
induction of various metaboiic diseases such as obesity as we,, as related 
disorders such as metabolic syndrome, eating disorder, cachexia, d.abetes 
m e»i,us, hypertension, coronary hear, disease, hypercholesterolemia, 
dyslipidemia, os,eoar,hri,is. and/o, gallstones. The nudeotide sequences 
encoding CG3842 homolog proteins may be labeled by standard methods, 
and added to a fluid or tissue sample from a patient under conditions 
suitable for the formation of hybridization comp.exes. After a su,,ab,e 
incubation period, the sample is washed and the signa, is quantised and 
compared with a standard value. The presence of altered levels o 
„ nucleotide sequences encoding CG3842 homolog proteins ,n the sample 
indicates the presence of the associated disease. Such assays may also 
used to evaluate the efficacy o, a particuiar therapeutic treatment reg.men 
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in animal studies, in clinical trials, or in monitoring the treatment of an 
individual patient. 

I„ order to provide a basis for the diagnosis of disease associated with 
expression of CQ3842 homolog proteins, a normal or standard profile for 
expression is established. This may be accomplished by combining body 
fluids or cell extracts taken from norma, subjects, either animal or human, 
with a sequence which encodes CG3842 homolog proteins or a fragment 
thereof under conditions suitable for hybridization or amp.ification. 
Standard hybridization may be quantified by comparing the values obtained 
fr0 m normal subjects with those from an experiment where a Known 
amount of a substantially purified polynucleotide is used. Standard values 
obtained from norma, samples may be compared with values obtained from 
samples from patients who ere symptomatic .or disease. Dev.ation 
between standard and subject va.ues is used ,o establish the presence of 
disease Once disease is established and a treatment protocol is imtiated. 
„ yb ,idization assays may be repeated on a regular basis to evaluate 
whether the level of expression in the patien, begins ,o approbate ha,, 
which is observed in the norma, patient. The results obtained from 
successive assays may be used ,0 show ,he efficacy of treatment over a 
period ranging from several days to months. 

With respect to me,abo,ic diseases (metabolic syndrome,, the presence of 
a relatively high amount of transcript in biopsied tissue from an indiv.dua, 
may indicate a predisposition for the development of the disease, or may 
pr0 vide a means for detecting ,he disease prior ,o the appearance of 
clinical symptoms. A more definitive diagnosis of this type may allow 
health professionals to employ preventative measures or aggressive 
treatment eariier thereby preventing the development or further progression 
, of the metabolic diseases and disorders. Additional diagnostic uses or 
oligonucleotides designed from the sequences encoding CG3842 homolog 
pro ,eins may involve the use of PCR. Such oligomers may be chem.cally 
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synthesized, generated enzymatically, or produced from a recombinant 
source Oligomers will preferably consist of two nucleotide sequences, one 
with sense orientation and another with antisense. employed under 
optimized conditions for identification 'of a specific gene or condition. The 
same two oligomers, nested sets of oligomers, or even a degenerate poo, 
o, oligomers may be employed under less stringent conditions for detecon 
and/or quantification of closely related DNA or RNA sequences. 

Methods which may also be used to quantity the expression of CG3842 
, homolog proteins include radiolabeling or biotiny,ating nudeobdes 

coamplification of a contro, nuCeic acid, and standard curves ontc , w ,,ch 

the experimental results are interpolated (Melby. P. C. et a.. (19931 J. 

.mmunol. Methods. 1 59:235-244; Duplaa. C. e, a,. ,1993. Ana,. B.ochem. 

212-229-236). The speed of quantification of multiple samples may be 
. accelerated by running the assay in an ELISA forma, where the oligomer of 

interest is presented in various dilutions and a spectrophotometry or 

colorimetric response gives rapid quantification. 

,n another embodiment of the invention, the nucleic acid sequences, which 
2q encode CG3842 homolog proteins, may also be used to generate 
hybridization probes, which are usefu, for mapping the naturally occurring 
9 enomic sequence. The sequences may be mapped to a particular 
chromosome or to a specific region of the chromosome using well known 
techniques. Such techniques include FISH. FACS. or artificial chromosome 
25 constructions, such as yeast artificial chromosomes, bacteria, art,f,c,a, 
chromosomes, bacteria, PI constructions or single chromosome cDNA 
, ib ,aries as reviewed in Price, C. M. (1993) Blood Rev. 7:127-134. and 
Trask B. J. (1991) Trends Genet. 7:149-154. FISH (as described in Verma 
et a,' (1988) Human Chromosomes: A Manual of Basic Techn.ques. 
30 Pergamon Press. New York. N.Y., may be correlated with other physical 
chromosome mapping techniques and genetic map data. Examples of 
genetic map data can be found in the 1994 Genome Issue of Scence 
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(265:1 981 f). Correlation between the location of the gene encoding 
CG3842 oh a physical chromosomal map and a specific disease, or 
predisposition to a specific disease, may help to delimit the region of DNA 
associated with that genetic disease. 

The nucleotide sequences of the CG3842 homolog proteins may be used 
to detect differences in gene sequences between normal, carrier, or 
affected individuals. In situ hybridization of chromosomal preparations and 
physical mapping techniques such as linkage analysis using established 
chromosomal markers may be used for extending genetic maps. Often the 
placement of a gene on the chromosome of another mammalian species, 
such as mouse, may reveal associated markers even if the number or arm 
of a particular human chromosome is not known. New sequences can be 
assigned to chromosomal arms, or parts thereof, by physical mapping. This 
provides valuable information to investigators searching for disease genes 
using positional cloning or other gene discovery techniques. Once the 
disease or syndrome has been crudely localized by genetic linkage to a 
particular genomic region, for example, AT to 1 1 q22-23 (Gatti, R. A. et al. 
(1988) Nature 336:577-580), any sequences mapping to that area may 
represent associated or regulatory genes for further investigation. The 
nucleotide sequences of the subject invention may also be used to detect 
differences in the chromosomal location due to translocation, inversion, 
etc. among normal, carrier, or affected individuals. 

In another embodiment of the invention, the CG3842 homolog proteins, 
their catalytic or immunogenic fragments or oligopeptides thereof, an in 
vitro model, a genetically altered cell, or animal, can be used for screening 
libraries of compounds in any of a variety of drug screening techniques. 

One can identify effectors, e.g. receptors, enzymes, proteins, ligands, or 
substrates that bind to, modulate or mimic the action of one or more of the 
proteins of the invention. The protein or a fragment thereof employed in 
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such screening may be free in solution, affixed to a solid support, borne on 
a cell surface, or located intracellular^. The formation of binding 
complexes, between CG3842 homolog proteins and the agent tested, may 
be measured. Agents could, either directly or indirectly, influence the 

5 activity of the proteins of the invention. Mechanisms of direct influence 
could for example, but not exclusively, be the interference of agents with 
either substrate or cofactor recognition, binding, and conversion. For 
example the dehydrogenase activity could be measured in vitro by using 
recombinantly expressed and purified CG3842 homolog proteins or 

io fragments thereof. 

For example, an in vitro measurement of the oxidoreductase activity of the 
proteins of the invention can be performed by using tritiated steroids, as 
for example but not limited to dihydrotestosterone, progesterone, 
15 corticosterone, aldosterone, androsterone, allopregnanolone, or 
3-androstanediol. The reaction can be driven in the oxidative direction in 
the presence of the cofactors NADP + /NAD + or in the reductive direction in 
the presence of NADPH/NADH. The reaction products can be extracted 
and analysed on silica gel TLC plates. Alternatively, retinoid substrates, i.e. 
20 all-trans and cis-isomers of retinoids, can be used as oxidoreductase 
substrates and product generation can be analysed by procedures known 
in the art, such as HPLC separation (Kedishvili et al. (2002), J. Biol. Chem. 
277 (No. 32), 28909-28915). In another embodiment of the invention, the 
oxidoreductase activity of the enzymes can be monitored following the 
25 change in fluorescence by conversion of NADPH or NADH to or from 
NADP + or NAD + , respectively. 

In addition, activity of CG3842 homolog proteins against its physiological 
substrate(s) or derivatives thereof can be measured in cell-based assays. 
30 Agents may also interfere with posttranslational modifications of the 
protein, such as phosphorylation and dephosphorylation, farnesylation, 
palmitoylation, acetylation. alkylation, ubiquitination, proteolytic 
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processing, subcellular localization, and degradation. Moreover, agents can 
influence the dimerization or o.igomerization of the proteins of the invent.on 
or in a heterologous manner, of the proteins of the invention with other 
proteins for example, but not exclusively, docking proteins, enzymes, 
receptors, or translation factors. Agents can also act on the physical 
interaction of the proteins of this invention with other proteins, wh.ch are 
required for protein function, for example, but not exclusively, the.r 
downstream signaling. 

Methods for determining protein-protein interaction are well known in the 
art For example, binding of a fluorescent.y labeled peptide derived from 
CG3842 homolog protein interacting protein (or vice versa), can be 
detected by a change in polarisation. In case that both binding partners, 
which can be either the full length proteins as well fragments thereof are 
f, U orescent«y labeled, binding can be detected by fluorescence energy 
transfer (FRET) from one f.uorophore to the other. In addition, a variety of 
commercially availab.e assay principles suitable for detection of 
protein-protein interaction are well known in the art, for example but not 
exclusively AlphaScreen (Perkin Elmer) or Scintillation Proximity Assays 
(SPA) by Amersham. Alternatively, the interaction of CG3842 homolog 
proteins with cellular proteins can be the basis for a cell-based scr.en.ng 
assay in which both proteins are fluorescent.y labeled and interacts of 
both proteins is detected by analyzing cotrans.ocation of both proteins with 
a cellular imaging reader, as developed for example, but not exclusively, by 
Genomics or EvotecOAI. In all cases, the two or more binding partners can 
be different proteins with one being a CG3842 homolog protein, or ,n case 
of dimerization and/or o.igomerization the CG3842 homolog protein .tse.f . 
CG3842 homolog proteins, for which one target mechanism of mterest, 
but not the only one, would be such protein/protein interactions are 
unnamed human protein XP_085058 and human CGI-82. 
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Of particular interest are screening assays for agents that have a low 
toxicity for mammalian cells. The term "agent" as used herein describes 
any molecule, e.g. protein or pharmaceutical, with the capability of altering 
or mimicking the physiological function of one or more of the proteins of 
the invention. Candidate agents encompass numerous chemical classes, 
though typically they are organic molecules, preferably small organic 
compounds having a molecular weight of more than 50 and less than 
about 2,500 Daltons. Candidate agents comprise functional groups 
necessary for structural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl, hydroxyl or 
carboxyl group, preferably at least two of the functional chemical groups. 
The candidate agents often comprise carbocyclic or heterocyclic structures 
and/or aromatic or polyaromatic structures substituted with one or more of 
the above functional groups. 

Candidate agents are also found among biomolecules including peptides, 
saccharides, fatty acids, steroids, purines, pyrimidines, nucleic acids and 
derivatives, structural analogs or combinations thereof. Candidate agents 
are obtained from a wide variety of sources including libraries of synthetic 
or natural compounds. For example, numerous means are available for 
random and directed synthesis of a wide variety of organic compounds and 
biomolecules, including expression of randomized oligonucleotides and 
oligopeptides. Alternatively, libraries of natural compounds in the form of 
bacterial, fungal, plant and animal extracts are available or readily 
produced. Additionally, natural or synthetically produced libraries and 
compounds are readily modified through conventional chemical, physical 
and biochemical means, and may be used to produce combinatorial 
libraries. Known pharmacological agents may be subjected to directed or 
random chemical modifications, such as acylation, alkylation, esterification, 
amidification, etc. to produce structural analogs. Where the screening 
assay is a binding assay, one or more of the molecules may be joined to a 
label, where the label can directly or indirectly provide a detectable signal. 
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Another technique for drug screening, which may be used, provides for 
high throughput screening of compounds having suitable binding affinity to 
the protein of interest as described in published PCT application 
WO84/03564. In this method, as applied to CG3842 homolog proteins, 
large numbers of different small test compounds are synthesized on a solid 
substrate, such as plastic pins or some other surface. The test compounds 
are reacted with CG3842 homolog proteins, or fragments thereof, and 
washed. Bound CG3842 homolog proteins are then detected by methods 
well known in the art. Purified CG3842 homolog proteins can also be 
coated directly onto plates for use in the aforementioned drug screening 
techniques. Alternatively, non-neutralizing antibodies can be used to 
capture the peptide and immobilize it on a solid support. In another 
embodiment, one may use competitive drug screening assays in which 
neutralizing antibodies capable of binding CG3842 homolog proteins 
specifically compete with a test compound for binding CG3842 homolog 
proteins. In this manner, the antibodies can be used to detect the presence 
of any peptide, which shares one or more antigenic determinants with 
CG3842 homolog proteins. In additional embodiments, the nucleotide 
sequences which encode CG3842 homolog proteins may be used in any 
molecular biology techniques that have yet to be developed, provided the 
new techniques rely on properties of nucleotide that are currently known, 
including, but not limited to, such properties as the triplet genetic code and 
specific base pair interactions. 

The nucleic acids encoding the proteins of the invention can be used to 
generate transgenic cell lines and animals. These transgenic non-human 
animals are useful in the study of the function and regulation of the 
proteins of the invention in vivo. Transgenic animals, particularly 
mammalian transgenic animals, can serve as a model system for the 
investigation of many developmental and cellular processes common to 
humans. A variety of non-human models of metabolic disorders can be 
used to test modulators of the protein of the invention. Misexpression (for 
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example, overexpression or lack of expression) of the protein of the 
invention, particular feeding conditions, and/or administration of 
biologically active compounts can create models of metablic disorders. 

In one embodiment of the invention, such assays use mouse models of 
insulin resistance and/or diabetes, such as mice carrying gene knockouts in 
the leptin pathway (for example, ob (leptin) or db (leptin receptor) mice). 
Such mice develop typical symptoms of diabetes , show hepatic lipid 
accumulation and frequently have increased plasma lipid levels (see 
Bruning et al, 1998, Mol. Cell. 2:449-569). Susceptible wild type mice (for 
example C57BI/6) show similiar symptoms if fed a high fat diet. In addition 
to testing the expression of the proteins of the invention in such mouse 
strains (see EXAMPLES), these mice could be used to test whether 
administration of a candidate modulator alters for example lipid 
accumulation in the liver, in plasma, or adipose tissues using standard 
assays well known in the art, such as FPLC, colorimetric assays, blood 
glucose level tests, insulin tolerance tests and others. 

Transgenic animals may be made through homologous recombination in 
embryonic stem cells, where the normal locus of the gene encoding the 
protein of the invention is mutated. Alternatively, a nucleic acid construct 
encoding the protein is injected into oocytes and is randomly integrated 
into the genome. One may also express the genes of the invention or 
variants thereof in tissues where they are not normally expressed or at 
abnormal times of development. Furthermore, variants of the genes of the 
invention like specific constructs expressing anti-sense molecules or 
expression of dominant negative mutations, which will block or alter the 
expression of the proteins of the invention may be randomly integrated into 
the genome. A detectable marker, such as lac Z or luciferase may be 
introduced into the locus of the genes of the invention, where upregulation 
of expression of the genes of the invention will result in an easily 
detectable change in phenotype. Vectors for stable integration include 
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plasmids, retroviruses and other animal viruses, yeast artificial 
chromosomes (YACs), and the like. DNA constructs for homologous 
recombination will contain at least portions of the genes of the invention 
with the desired genetic modification, and will include regions of homology 
to the target locus. Conveniently, markers for positive and negative 
selection are included. DNA constructs for random integration do not need 
to contain regions of homology to mediate recombination. DNA constructs 
for random integration will consist of the nucleic acids encoding the 
proteins of the invention, a regulatory element (promoter), an intron and a 
poly-adenylation signal. Methods for generating cells having targeted gene 
modifications through homologous recombination are known in the field. 
For embryonic stem (ES) cells, an ES cell line may be employed, or 
embryonic cells may be obtained freshly from a host, e.g. mouse, rat, 
guinea pig, etc. Such cells are grown on an appropriate fibroblast-feeder 
layer and are grown in the presence of leukemia inhibiting factor (LIF). ES 
or embryonic cells may be transfected and can then be used to produce 
transgenic animals. After transfection, the ES cells are plated onto a feeder 
layer in an appropriate medium. Cells containing the construct may be 
selected by employing a selection medium. After sufficient time for 
colonies to grow, they are picked and analyzed for the occurrence of 
homologous recombination. Colonies that are positive may then be used for 
embryo manipulation and morula aggregation. Briefly, morulae are obtained 
from 4 to 6 week old superovulated females, the Zona Pellucida is removed 
and the morulae are put into small depressions of a tissue culture dish. The 
ES cells are trypsinized, and the modified cells are placed into the 
depression closely to the morulae. On the following day the aggregates are 
transfered into the uterine horns of pseudopregnant females. Females are 
then allowed to go to term. Chimeric offsprings can be readily detected by 
a change in coat color and are subsequently screened for the transmission 
of the mutation into the next generation (F1 -generation). Offspring of the 
F1 -generation are screened for the presence of the modified gene and 
males and females having the modification are mated to produce 
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homozygous progeny. If the gene alterations cause lethality at some point 
in development, tissues or organs can be maintained as allogenic or 
congenic grafts or transplants, or in vitro culture. The transgenic animals 
may be any non-human mammal, such as laboratory animal, domestic 
animals, etc., for example, mouse, rat. guinea pig, sheep, cow, pig, and 
others. The transgenic animals may be used in functional studies, drug 
screening, and other applications and are useful in the study of the 
function and regulation of the proteins of the invention in vivo. 

Finally, the invention also relates to a kit comprising at least one of 
a SCAD nucleic acid molecule or a fragment thereof; 
a vector comprising the nucleic acid of (a); 

a host cell comprising the nucleic acid of (a) or the vector of (b); 
a polypeptide encoded by the nucleic acid of (a); 

(e) a fusion polypeptide encoded by the nucleic acid of (a); 

(f) an antibody, an aptamer or another receptor against the nucleic acid 
of (a) or the polypeptide of (d) or (e) and 

(g) an anti-sense oligonucleotide of the nucleic acid of (a). 

The kit may be used for diagnostic or therapeutic purposes or for screening 
applications as described above. The kit may further contain user 
instructions. 



(a) 
(b) 
(c) 
(d) 



The FIGURES show: 



) 



FIGURE 1 shows the increase of triglyceride content of PX2287.1 flies 
(referred to as "2287.1") caused by integration of the P-vector (in 
comparison to controls with integration of these vectors elsewhere in 
genome, referred to as "TG010419, n = 60"). 
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FIGURE 2 shows the molecular organization of the mutated gene locus of 
Gadfly Accession Number CG3842. 

FIGURE 3 shows the BLASTP search results for Gadfly Accession Number 
CG3842 (Query) with the best human homologous matches (Sbject). 
FIGURE 3A shows the homology to human unnamed protein with GenBank 
Accession Number XP_085058.1 (SEQ ID NO: 2) 

FIGURE 3B shows the homology to human PAN 2 protein (GenBank 
Accession Number NP_065956.1 ). 

FIGURE 3C shows the homology to human CGI-82 protein (GenBank 
Accession Number NP_057110.1, SEQ ID NO: 4). 

FIGURE 4 shows the Clustal X (1 .81 ) multiple sequence alignment analysis 
containing protein sequences for human CGI-82 (Accession Number 
NP_057110, SEQ ID NO: 4), human XPJD85058 (SEQ ID NO: 2), 
Drosophila GadFly Accession Number CG3842, and human PAN 2 
(Accession Number NP_065956) 

FIGURE 5 shows the human sequences of the invention 
FIGURE 5A. shows the nucleic acid sequence of human unnamed protein 
XP_085058 (also referred to herein as DG21-1) (SEQ ID NO:1; GenBank 
Accession Number XMJ)85058). 

FIGURE 5B. shows the amino acid sequence of human unnamed protein 
XP 085058 (SEQ ID NO:2; GenBank Accession Number XP085058). 
FIGURE 5C. shows the nucleic acid sequence of human CGI-82 (also 
referred to herein as DG21 -2 or PSDR1 ) (SEQ ID NO:3; GenBank Accession 
Number NMJ)16026). 

FIGURE 5D. shows the amino acid sequence of human CGI-82 (also 
referred to herein as DG21-2 or PSDR1) (SEQ ID NO:4; GenBank 
Accession Number NPJD57110). 
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FIGURE 6 Expression of unnamed protein (DG21-1) in mammalian tissues. 
The relative RNA-expression is shown on the X-axis. In FIGURE 6 A, B, and 
C. the tissues tested are given on the X-axis. "WAT" refers to white 
adipose tissue, "BAT" refers to brown adipose tissue. In FIGURES D, E, 
. and F the X-axis represents the time axis. "dO" refers to day 0 (start of 
the experiment). "d2" - "d 10" refers to day 2 - day 10 of adipocyte 
differentiation). 

FIGURE 6A: Real-time PCR analysis of unnamed protein (DG21-1) in 

10 wildtype mouse tissues. 

FIGURE 6B: Real-time PCR mediated analysis of unnamed protein (DG21-1) 

in different mouse models. 

FIGURE 6C: Real-time PCR mediated analysis of unnamed protein (DG21-1) 
in different mouse models (different diets). 
15 FIGURE 6D: Real-time PCR mediated analysis of unnamed protein (DG21-1) 
expression during the differentiation of 3T3-L1 cells from preadipocytes to 
mature adipocytes. 

FIGURE 6E: Real-time PCR mediated analysis of unnamed prote.n 
(DG21-1) expression during the differentiation of 3T3-F442A cells from 
20 preadipocytes to mature adipocytes. 

FIGURE 6F: Real-time PCR mediated analysis of unnamed protein (DG21-1) 
expression during the differentiation of TA1 cells from preadipocytes to 
mature adipocytes. 

25 FIGURE 7. Expression of CGl-82 (DG21-2) in mammalian tissues. 

The relative RNA-expression is shown on the Y-axis. The tissues tested are 
given on the X-axis. "WAT" refers to white adipose tissue, "BAT" refers to 
brown adipose tissue. 
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FIGURE 7A: Real-time PCR analysis of CGl-82 (DG21-2) in wildtype mouse 



tissues. 
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FIGURE 7B: Real-time PCR mediated analysis of CGI-82 (DG21-2) in 
different mouse models. 

FIGURE 7C: Real-time PCR mediated analysis of CGI-82 (DG21-2) in 
different mouse models (different diets). 

FIGURE 8 shows in vitro assays for the determination of glycogen, fatty 
acid metabolism and lipid synthesis in cells overexpressing unnamed 
protein (DG21-1) or CGI-82 (DG21-2) 

FIGURE 8A shows an increase in glycogen levels in cells overexpressing 
unnamed protein (DG21-1) versus control cells. The Y-axis shows cellular 
glycogen levels (//g/mg protein) and the X-axis shows days of cell 
differentiation. 

FIGURE 8B shows no changes in glycogen levels in cells overexpressing 
CGI-82 (DG21-2) versus control cells. The Y-axis shows cellular glycogen 
levels (//g/mg protein) and the X-axis shows days of cell differentiation. 

FIGURE 8C shows an increase in free fatty acid uptake in cells 
overexpressing unnamed protein (DG21 -1 ) or CGI-82 (DG21 -2). The Y-axis 
shows free fatty acid levels (shown as DPM per mg protein). 

FIGURE 8D shows a reduction in free fatty acid esterification in cells 
overexpressing unnamed protein (DG21-1) or CGI-82 (DG21-2). The 
Y-axis shows free fatty acid levels (shown as DPM per mg protein). 

FIGURE 8E shows an increase in lipid synthesis in cells overexpressing 
unnamed protein (DG21-1) or CGI-82 (DG21-2) with or without insulin 
stimulation. The Y-axis shows lipid synthesis (shown as DPM per mg 
protein). 
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The examples illustrate the invention: 

Example 1 : Measurement of energy storage metabolites content 

Mutant flies are obtained from a proprietary fly mutation stock collection. 
The flies are grown under standard conditions known to those skilled in the 
art. In the course of the experiment, additional feedings with bakers yeast 
(Saccharomyces cerevisiae) are provided. The average increase of 
triglyceride content of Drosophila flies containing the transposon vector in 
the homozygous viable PX2287.1 integration was investigated in 
comparison to control flies (FIGURE 1). For determination of triglyceride 
content, flies were incubated for 5 min at 70°C in an aqueous buffer using 
a waterbath, followed by hot extraction. After another 5 min incubation at 
70°C and mild centrifugation, the triglyceride content of the flies extract 
was determined using Sigma Triglyceride (INT 336-10 or -20) assay by 
measuring changes in the optical density according to the manufacturer's 
protocol. As a reference protein content of the same extract was measured 
using BIO-RAD DC Protein Assay according to the manufacturer's protocol. 
The assay was repeated several times. The average triglyceride level of all 
flies of the PX collection is shown as 100% in FIGURE 1. PX2287.1 
homozygous flies show constantly a higher triglyceride content than the 
controls The average increase of triglyceride content of the homozygous 
viable Drosophila line PX2287.1 is 80% (column 2 in FIGURE 1). 
Therefore, the change of gene activity in the locus of the PX2287.1 
integration on chromosome X where the EP-vector of PX2287.1 flies is 
homozygous viable integrated, is responsible for changes in the metabolism 
of the energy storage triglycerides. 

Example 2: Identification of a Drosophila gene associated with metabolic 
control 
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In FIGURE 2, genomic DNA is represented by the assembly as a thin black 
line in the middle (numbers represent the length in basepairs of the 
genomic DNA) that includes the integration sites of vector for line 
PX2287.1. Transcribed DNA sequences (ESTs) and predicted exons are 
shown as bars on the two sides (sense and antisense strand). Predicted 
exons of the cDNA are shown as dark grey bars and introns as light grey 
lines. The sequence encodes for a gene that is predicted by GadFly 
sequence analysis programs as Accession Number CG3842. Using those 
isolated genomic sequences public databases like Berkeley Drosophila 
Genome Project (GadFly) were screened confirming the homozygous viable 
integration site of the PX2287.1 vector 542 basepairs downstream of the 
coding sequence of CG3842, causing an increase of triglyceride content 
(the site of integration is shown as vertical dotted line). Therefore, 
expression of the cDNA encoding Accession Number CG3842 could be 
effected by homozygous viable integration of vectors of line PX2287.1, 
leading to an increase of the energy storage triglycerides. 

Example 3: Identification of human CG3842 homologues 

CG3842 homologous proteins and nucleic acid molecules coding therefore 
are obtainable from insect or vertebrate species, e.g. mammals or birds. 
The most similar human nucleic acid sequences and the proteins encoded 
thereby have been determined using the BLAST algorithm searching public. 
GenBank databases (see FIGURE 3). The most homologous human proteins 
are unnamed protein with GenBank Accession Number XP085058 (64% 
homology; see FIGURE 3A), human CGI-82 protein (GenBank Accession 
Number NM_01 6026; 62% homology; see FIGURE 3C), and PAN2 
(GenBank Accession Number NMJ320905; 59% homology; see FIGURE 
3B). 
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The results of a comparison of the Drosophila adh-short domains (e.g. 
GadFly Accession Number CG3842, amino acids 73 to 328) with the 
adh-short domains unnamed protein with GenBank Accession Number 
XP 085058, of human CGI-82 protein (GenBank Accession Number 
NMJD1 6026), and of human PAN 2 protein (GenBank Accession Number 
NMJD20905) in a pairwise alignment are shown in TABLE 1 . 

TABLE 1 . Results of pairwise alignment of deduced amino acid sequences 
of Drosophila GadFly Accession Numbers CG3842 and closely related 
human proteins unnamed protein XP_085058, CGI-82 (PSDR1 ), and PAN 2. 



Human protein 


Drosophila protein 


Identities/Similarities 


XP_085058 


CG3842 


56%/67% 


CGI-82 


CG3842 


55%/66% 


PAN2 


CG3842 


52%/63% 



A ClustaW (1 .81) multiple sequence alignment has been conducted among 
the adh-short domains of the proteins described in TABLE 1 above and is 
shown in FIGURE 4. 

Example 4: Expression of the polypeptides in mammalian tissues 

To analyse the expression of the rnRNA coding for the proteins of the 
invention in mammalian tissues, several mouse strains (preferably mice 
strains C57BI/6J, C57B1/6 ob/ob and C57BI/KS db/db which are standard 
model systems in obesity and diabetes research) were purchased from 
Harlan Winkelmann (33178 Borchen, Germany) and maintained under 
constant temperature (preferably 22°C), 40 per cent humidity and a light 
/ dark cycle of preferably 14/10 hours. The mice were fed a standard 
chow (for example, from ssniff Spezialitaten GmbH, order number ssniff 
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M-Z V1 1 26-000). For the fasting experiment, wild type mice were starved 
for 48 h without food, but only water supplied ad libitum, (see, for 
example, Schnetzler et al. J Clin Invest 1993 Jul;92(l):272-80, Mizuno et 
al. Proc Natl Acad Sci U S A 1996 Apr 1 6;93(8):3434-8). Animals were 
sacrificed at an age of 6 to 8 weeks. The animal tissues were isolated 
according to standard procedures known to those skilled in the art, snap 
frozen in liquid nitrogen and stored at -80°C until needed. 

To analyse the role of the proteins disclosed in this invention in the in vitro 
differentiation of different mammalian cell culture cells from pre-adipocytes 
to adipocytes, mammalian fibroblast (3T3-L1) cells (e.g., Green & Kehinde, 
Cell 1: 113-116, 1974) were obtained from the American Tissue Culture 
Collection (ATCC, Hanassas, VA, USA; ATCC- CL 173). Alternatively, a 
mammalian fibroblast TA1 cell line, a murine preadipocyte line derived from 
T101/2 mouse embryo fibroblasts (Chapman et al., 1984, J Biol Chem 
259(24) :1 5548-55) or a 3T3 derived celline called 3T3-F442A was used. 
3T3-L1 or TA-1 cells were maintained as fibroblasts and differentiated into 
adipocytes as described in the prior art (e.g., Qiu. et al., J. Biol. Chem. 
276:11988-95, 2001; Slieker et al., BBRC 251: 225-9, 1998). In brief, 
cells were plated in DMEM/10% FCS (Invitrogen, Karlsruhe, Germany) at 
50,000 cells/well in duplicates in 6-well plastic dishes and cultured in a 
humidified atmosphere of 5% C0 2 at 37°C. At confluence (defined as day 
0: dO) cells were transferred to serum-free (SF) medium, containing 
DMEM/HamF1 2 (3:1; Invitrogen), Fetuin (300microg/ml; Sigma, Munich, 
Germany), Transferrin (2microg/ml; Sigma), pantothenate (17microM; 
Sigma), biotin (ImicroM; Sigma), and EGF (0.8 nM; Hoffmann-La Roche, 
Basel, Switzerland). Differentiation was induced by adding Dexamethasone 
(DEX; 0.1 /jM; Sigma), 3-methyl-isobutyl-1 -methylxanthine (MIX; 0.5 mM; 
Sigma), and bovine insulin (5 //g/ml; Invitrogen) for four days. Four days 
after confluence (d4), cells were kept in SF medium until differentiation 
was completed at d12. The medium was changed every 48 hours and 
suitable aliquots of cells were taken. 
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TaqMan Analysis of the RNA expression 

RNA was isolated from mouse tissues or cell culture cells using Trizol 
Reagent (for example, from Invitrogen, Karlsruhe, Germany) and further 
purified with the RNeasy Kit (for example, from Qiagen, Germany) in 
combination with an DNase-treatment according to the instructions of the 
manufacturers and as known to those skilled in the art. Total RNA was 
reverse transcribed (preferably using Superscript II RNaseH- Reverse 
Transcriptase, from Invitrogen, Karlsruhe, Germany) and subjected to 
Taqman analysis preferably using the Taqman 2xPCR Master Mix (from 
Applied Biosystems, Weiterstadt, Germany; the Mix contains according to 
the Manufacturer for example AmpliTaq Gold DNA Polymerase, AmpErase 
UNG, dNTPs with dUTP, passive reference Rox and optimized buffer 
components) on a GeneAmp 5700 Sequence Detection System (from 
Applied Biosystems, Weiterstadt, Germany). 

The following primer/probe pairs were used for the TaqMan analysis 
(GenBank Accession Number AK020927 for mouse DG-21-1 sequence 
homolog to human unnamed protein XP_085O58; GenBank Accession 
Number AB030503 for mouse DG-21-2 sequence, homolog to human 
protein CGI-82): 

Mouse DG-21-1 forward primer (SEQ ID NO: 5): 5'- 
CAAGGGCCCGGAACAAG-3'; 

Mouse DG-21-1 reverse primer (SEQ ID NO: 6): 5'- 
TGGATTCCTAGAAGCTCGCAG-3'; 

Mouse DG-21-1 Taqman probe (5/6-FAM) (SEQ ID NO: 7): 

AACAGCTGAGCGATTGTGGAACGTCA (5/6-TAMRA) 

Mouse DG-21-2 forward primer (SEQ ID NO: 8): 5'- 

AATCCAGGACTACATTTCCCAGAG-3'; 

Mouse DG-21-2 reverse primer (SEQ ID NO: 9): 5'- 
CCTAGACACACCCCGGTTCA-3'; 
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Mouse DG-21-2 Taqman probe (5/6-FAM) (SEQ ID NO: 10): 
CTTTGCTCCCTGGCGGTTGCAG (5/6-TAMRA) 

The expression profiling studies confirm the particular relevance of the 
proteins of the invention as regulators of energy metabolism in mammalian 
cells. Taqman analysis revealed that the proteins of the invention are 
ubiquitously expressed in different types of mamalian tissues including 
metabolic active tissues such as white (WAT) and brown (BAT) adipose 
tissue. However, DG21-1 mRNA is predominantly expressed in spleen, 
bone marrow, lung, WAT and muscle tissues compared to other tissue 
types in wild type mouse as depicted in FIGURE 6A. DG21-2, although also 
ubiqutously expressed, shows an highest expression in testis (FIGURE 7A). 

Further, we show in this invention that the transcription of the genes 
coding for the proteins of the invention is regulated by fasting, high fat 
diet (HFD), or by genetically induced obesity. In this invention, we used 
mouse models of insulin resistance and/or diabetes, such as mice carrying 
gene knockouts in the leptin pathway (for example, ob (leptin) or db (leptin 
receptor/ligand) mice) to study the expression of the protein of the 
invention. Such mice develop typical symptoms of diabetes, show hepatic 
lipid accumulation and frequently have increased plasma lipid levels (see 
Bruning et al, 1998, Mol. Cell. 2:449-569). 

We found in this invention that DG21-1 (unnamed protein) mRNA is 
upregulated in pancreas and metabolic active tissue brown adipose tissue 
(BAT) of fasted and in BAT of genetically induced obese mice (ob-ob) 
compared to wildtype (wt) mice (FIGURE 6B) and also in pancreas and BAT 
of susceptible wild type mice (for example C57B1/6) that show symptoms 
of diabetes, lipid accumulation, and high plasma lipid levels, if fed a high 
fat diet (HFD) (FIGURE 6C). DG-21-2 (homolog to CGI-82 = PSDR1) is 
strongly up-regulated in white adipose tissue (WAT) of ob/ob mice 
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compared to wt mice (FIGURE 7B) and also in BAT and muscle of mice fed 
a HFD compared to mice fed a standard diet (FIGURE 7C). 

Unnamed protein XP_085058 (DG21-1) is significantly up regulated (35 
fold up regulation on day 6) during differentiation of 3T3-L1 cells (FIGURE 
6E). as well as in 3T3-F442A cells (60 fold upregulation on day 4) 
(FIGURE 6D) and TA1 cells (30 fold upregulation on day 12) cells (see 
FIGURE 6F). 

The tissue specific regulation of unnamed protein XP_085058 (DG-21-1) 
and CGI-82 (DG-21-2) and the regulation of DG-21-1 during in vitro 
adipocyte differentiation shows clearly that the proteins of the invention 
are modulatorsof adipogenesis and metabolism in mammalian cells. 



Example 5: In vitro assays in cells overexpressing the proteins of the 
invention 

Obesity is caused by an inbalance of energy stored and energy used by the 
regarding organism. In mammalian cells energy is stored by energy storage 
metabolites (ESM), mainly as glycogen or triglycerides. Glycogen is used as 
a quick response to urgent energy needs whereas triglycerides are used as 
fuel for long term energy expenditure. In this invention, we therefore 
analyzed the level of energy storage metabolites in cells overexpressing 
the proteins of the invention. 

Retroviral infection of preadipocytes 

Packaging cells were transfected with retroviral plasmids pLPCX carrying 
the mouse genes coding for the proteins of the invention and a selection 
marker using calcium phosphate procedure. Control cells were infected 
with pLPCX carrying no transgene. Briefly, exponentially growing 
packaging cells were seeded at a density of 350,000 cells per 6-well in 2 
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ml DMEM + 10 % FCS one day before transfection. 10 min before 
transfection chloroquine was added directly to the overlying medium (25 
microM end concentration). A 250 //I transfection mix consisting of 5 fjg 
plasmid-DNA (candidate:helper-virus in a 1 :1 ratio) and 250 mM CaCI 2 was 
prepared in a 1 5 ml plastic tube. The same volume of 2 x HBS (280 jjM 
NaCI # 50 //M HEPES, 1.5 mM Na 2 HP0 4 , pH 7.06) was added and air 
bubbles were injected into the mixture for 1 5 sec. The transfection mix 
was added drop wise to the packaging cells, distributed and the cells were 
incubated at 37 °C, 5 % CO z for 6 hours. The cells were washed with PBS 
and the medium was exchanged with 2 ml DMEM + 10 % CS per 6-weII. 
One day after transfection the cells were washed again and incubated for 
2 days of virus collection in 1 ml DMEM + 10 % CS per 6-well at 32°C / 
5 % C0 2 . The supernatant was then filtered through a 0.45 jjrn cellulose 
acetate filter and polybrene (end concentration 8 /yg/ml) was added. 
Mammalian fibroblast (3T3-L1) cells in a sub-confluent state were overlaid 
with the prepared virus containing medium. The infected cells were 
selected for 1 week with 2 //g/ml puromycin. Following selection the cells 
were checked for transgene expression by western blot and 
immunofluorescence. Overexpressing cells were seeded for differentiation. 
3T3-L1 cells were maintained as fibroblasts and differentiated into 
adipocytes as described in the prior art and above. For analysing the role of 
the proteins disclosed in this invention in vitro assays for the determination 
of ESM storage, synthesis and transport were performed. 

Preparation of cell lysates for analysis of energy storage metabolites 
Starting at confluence (d0), cell media was changed every 48 hours. Cells 
were harvested 6 hours after media change as follows. Media was 
collected, and cells were washed twice in PBS prior to lyses in 600 //I 
HB-buffer (0.5% polyoxyethylene 10 tridecylethan, 1 mM EDTA, 0.01 M 
NaH 2 P0 4 , pH 7.4). After inactivation at 70°C for 5 minutes, cell lysates 
were prepared on Bio 101 systems lysing matrix B (0.1 mm silica beads; 
Q-Biogene, Carlsbad, USA) by agitation for 2 x 45 seconds at a speed of 
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4.5 (Fastprep FP120, Bio 101 Thermosavant, Holbrock, USA). 
Supernatants of lysed ceils were collected after centrifugation at 3000 rpm 
for 2 minutes, and stored in aliquots for later analysis at -80 °C. 

Changes in cellular glycogen levels during adipogenesis 
Cell lysates were analysed in duplicates in 96-well plates for total protein 
and glycogen content using the Bio-Rad DC Protein assay reagent (Bio-Rad, 
Munich, Germany) according to the manufacturer's instructions and an 
enzymatic starch kit from Hoffmann-La Roche {Basel, Switzerland). 10-//L 
samples were incubated with 20-//I amyloglucosidase solution for 15 
minutes at 60 °C to digest glycogen to glucose. The glucose is further 
metabolised with 100 fj\ distilled water and 100 /j\ of enzyme cofactor 
buffer and 12 jj\ of enzyme buffer (hexokinase and glucose phosphate 
dehydrogenase). Background glucose levels are determined by subtracting 
values from a duplicate plate without the amyloglucosidase. Final 
absorbance is determined at 340 nm. HB-buffer as blank, and a standard 
curve of glycogen (Hoffmann-La Roche) were included in each assay. 
Glycogen content in samples were calculated using a standard curve. 

Glucose is taken up by the cells rapidly and stored in the form of glycogen. 
This energy storage is than primarily used for the metabolic demands of 
the cell. The Cellular glycogen levels were increased throughout 
adipogenesis {beginning on d6 with a maximum on d8 lasting up to d1 2) in 
differentiated adipocytes overexpressing unnamed protein XP 085058 , as 
shown in FIGURE 8A. This increase in glycogen level as a consequence of 
over expression of unnamed protein XP_085058 could reflect an elevated 
glucose uptake, a higher glycogen synthesis rate or a decreased energy 
consumption and thus confirms a role of unnamed protein XP_085058 in 
metabolic regulation. 
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Synthesis of lipids during adipogenesis 

During the terminal stage of adipogenesis (d12) cells were analysed for 
their ability to metabolise lipids. A modified protocol to the method of 
Jensen et al. (2000) for lipid synthesis was established. Cells were washed 
3 times with PBS prior to serum starvation in Krebs-Ringer-Bicarbonate- 
Hepes buffer (KRBH; 134 nM NaCI, 3.5 mM KCI, 1.2 mM KH 2 P0 4 , 0.5 
mM MgSO*, 1.5 mM CaCI 2/ 5 mM NaHC0 3/ 10 mM Hepes, pH 7.4), 
supplemented with 0.1% FCS for 2.5 h at 37°C. For insulin-stimulated 
lipid synthesis, cells were incubated with 1 /jM bovine insulin (Sigma; 
carrier: 0.005N HCI) for 45 min at 37 °C. Basal lipid synthesis was 
determined with carrier only. 14 C(U)-D-glucose (NEN Life Sciences) in a 
final activity of 1 //Ci/Well/mt in the presence of 5 mM glucose was added 
for 30 min at 37 °C. For the calculation of background radioactivity, 25 jjM 
cytochalasin B (Sigma) was used. All assays were performed in duplicate 
wells. To terminate the reaction, cells were washed 3 times with ice cold 
PBS, and lysed in 1 ml 0.1 N NaOH. Protein concentration of each well was 
assessed using the standard Biuret method (Protein assay reagent; 
Bio-Rad). Total lipids were separated from aqueous phase after overnight 
extraction in Insta-Fluor scintillation cocktail (Packard Bioscience) followed 
by scintillation counting. 

We show in this invention that cells overexpressing unnamed protein 
XP085058 are more effective in synthesising lipids from exogenous 
glucose. Consequently, the levels of insulin stimulated lipid synthesis of 
cells overexpressing unnamed protein XP085058 are also higher at day 
12 of adipogenesis when compared to control cells (FIGURE 8E). On the 
same day of differentiation, the lipid synthesis of CGI-82 overexpressing 
cells can not be stimulatd by insulin as with the control cells (FIGURE 8E). 
The increased level of lipid synthesis from exogenous glucose in cells 
overexpressing unnamed protein XP_085058 demonstrate an activation of 
enzymes involved in lipid synthesis in these cells as compared to control 
cells. 
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Transport and metabolism of free fatty acids during adipogenesis (FIGURE 
8G) 

During the terminal stage of adipogenesis (d12) cells were analysed for 
their ability to transport long chain fatty acid across the plasma membrane. 
A modified protocol to the method of Abumrad et al (1991)( Proc. Natl. 
Acad. Sci. USA, 1991: 88; 6008-12) for cellular transportation of fatty 
acid was established. In summary, cells were washed 3 times with PBS 
prior to serum starvation. This was followed by incubation in KRBH buffer 
supplemented with 0, 1 % FCS for 2.5h at 37°C. Uptake of exogenous free 
fatty acids was initiated by the addition of isotopic media containing non 
radioactive oleate and ( 3 H)oIeate (NEN Life Sciences) complexed to serum 
albumin in a final activity of 1 //Ci/Well/ml in the presence of 5 mM glucose 
for 30 min at room temperature (RT). For the calculation of passive 
diffusion (PD) in the absence of active transport (AT) across the plasma 
membrane 20 mM of phloretin in glucose free media (Sigma) was added for 
30 min at RT. All assays were performed in triplicate wells. To terminate 
the active transport 20 mM of phloretin in glucose free media was added 
to the cells. Cells were lysed in 1 ml 0.1 N NaOH and the protein 
concentration of each well were assessed using the standard Biuret 
method (Protein assay reagent; Bio-Rad). Esterified fatty acids were 
separated from free fatty acids using overnight extraction in Insta-Fluor 
scintillation cocktail (Packard Bioscience) followed by scintillation counting. 

. We found in this invention that transport of exogenous fatty acids across 
the plasma membrane of overexpressing cells overexpressing unnamed 
protein XP 085058 were significantly higher at day 12 of adipogenesis 
when compared to control cells (FIGURE 8C). We also show in this 
invention that esterification of fatty acids of CGI-82 overexpressing cells 
but not cells overexpressing unnamed protein XP085058 were 
significantly lower at day 12 of adipogenesis when compared to control 
cells (FIGURE 8D). 
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The profound increase in uptake of free fatty acids of cells overexpressing 
unnamed protein XP_085058 and the decrease of the free fatty acid 
esterification in CGI-82 overexpressing cells could be due to a direct action 
of the proteins of the invention or alternatively due to a role in the 
regulation of free fatty acid uptake and esterification. The proteins of the 
invention may for example play a role in converting regulatory inactive 
retinoids or steroids in regulatory activ hormone derivatives which then 
influence fatty acid metabolism directly or regulate the gene expression of 
fatty acid metabolic enzymes or transporters on the transcriptional level. 

Example 6: Generation and analysis of transgenic mice 

Generation of the transgenic animals 

Mouse cDNA was isolated from mouse brown adipose tissue (BAT) using 
standard protocols as known to those skilled in the art. The cDNA was 
amplified by RT-PCR and point mutations were introduced into the cDNA. 
The resulting mutated cDNA was cloned into a suitable transgenic 
expression vector. The transgene was microinjected into the male 
pronucleus of fertilized mouse embryos (preferably strain C57/BL6/CBA F1 
(Harlan Winkelmann). Injected embryos were transferred into 
pseudo-pregnant foster mice. Transgenic founders were detected by PCR 
analysis. Two independent transgenic mouse lines containing the construct 
were established and kept on a C57/BL6 background. Briefly, founder 
animals were backcrossed with C57/BL6 mice to generate F1 mice for 
analysis. Transgenic mice were continously bred onto the C57/BI6 
background. The expression of the proteins of the invention can be 
analyzed by taqman analysis as described above, and further analysis of 
the mice can be done as known to those skilled in the art. 

All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the 
described method and system of the invention will be apparent to those 
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skilled in the art without departing from the scope and spirit of the 
invention. Although the invention has been described in connection with 
specific preferred embodiments, it should be understood that the invention 
as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the 
invention which are obvious to those skilled in molecular biology or related 
fields are intended to be within the scope of the following claims. 
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Claims 



5 
10 

2. 

20 

3. 

25 



A pharmaceutical composition comprising a nucleic acid molecule of 
the short-chain dehydrogenase (SCAD) gene family or a polypeptide 
encoded thereby or a fragment or a variant of said nucleic acid 
molecule or said polypeptide or an effector of a nucleic acid 
molecule of the SCAD gene family or said polypeptide, e.g. an 
antibody, an aptamer or another receptor recognizing a nucleic acid 
molecule of the SCAD gene family or said polypeptide encoded 
thereby, preferably together with pharmaceutical^ acceptable 
carriers, diluents and/or adjuvants. 

The composition of claim 1 , wherein the nucleic acid molecule is a 
vertebrate or insect SCAD nucleic acid, particularly a nucleic acid 
encoding a Drosophila protein (GadFly Accession Number CG3842), 
an unnamed protein (SEQ ID NO: 1 and 2), a human CGI-82 protein 
(SEQ ID NO: 3 and 4), or or a human PAN2 protein (GenBank 
Accession Number NP_065956 for the protein, NMJD20905 for the 
cDNA) or a fragment there of or a variant thereof and/or a nucleic 
acid complementary thereto. 

The composition of claim 1 or 2, wherein said nucleic acid molecule 

(a) hybridizes at 50°C in a solution containing 1 x SSC and 0. 1 % 
SDS to a nucleic acid molecule as defined in claim 2 and/or a 
nucleic acid molecule which is complementary thereto; 

(b) it is degenerate with respect to the nucleic acid molecule of 



(c) encodes a polypeptide which is at least 85%, preferably at 
least 90%, more preferably at least 95%, more preferably at 
least 98% and up to 99,6% identical to a SCAD polypeptide 
as defined in claim 2; 
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(d) differs from the nucleic acid molecule of (a) to (c) by mutation 
and wherein said mutation causes an alteration, deletion, 
duplication or premature stop in the encoded polypeptide. 

4. The composition of any one of claims 1-3, wherein the nucleic acid 
molecule is a DNA molecule, particularly a cDNA or a genomic DNA. 

5. The composition of any one of claims 1 -4, wherein said nucleic acid 
encodes a polypeptide contributing to regulating the energy 
homeostasis and/or the metabolism of triglycerides. 

6. The composition of any one of claims 1 -5, wherein said nucleic acid 
molecule is a recombinant nucleic acid molecule. 



7. The composition of any one of claims 1-6, wherein the nucleic acid 
molecule is a vector, particularly an expression vector. 

8. The composition of any one of claims 1-5, wherein the polypeptide 
is a recombinant polypeptide. 

9. The composition of claim 8, wherein said recombinant polypeptide is 
a fusion polypeptide. 

10. The composition of any one of claims 1-7, wherein said nucleic acid 
molecule is selected from hybridization probes, primers and 
anti-sense oligonucleotides. 

11. The composition of any one of claims 1-10 which is a diagnostic 
composition. 

12. The composition of any one of claims 1-10 which is a therapeutic 
composition. 
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5 

10 14. 

15 

15. 

20 
25 



The composition of any one of claims 1-12 for the manufacture of 
an agent for detecting and/or verifying, for the treatment, alleviation 
and/or prevention of an disorders, including metabolic diseases such 
as obesity and other body-weight regulation disorders as well as 
related disorders such as metabolic syndrome, eating disorder, 
cachexia, diabetes mellitus, hypertension, coronary heart disease, 
hypercholesterolemia, dyslipidemia, osteoarthritis and/or gallstones 
and others, in cells, cell masses, organs and/or subjects. 

Use of a nucleic acid molecule of the SCAD gene family or a 
polypeptide encoded thereby or a fragment or a variant of said 
nucleic acid molecule or said polypeptide or an effector, e.g. an 
antibody, an aptamer or another receptor recognizing a nucleic acid 
molecule of the SCAD gene family or a polypeptide encoded thereby 
for controlling the function of a gene and/or a gene product which is 
influenced and/or modified by a SCAD homologous polypeptide. 

Use of the nucleic acid molecule of the SCAD gene family or a 
polypeptide encoded thereby or a fragment or a variant of said 
nucleic acid molecule or said polypeptide or an efector, e.g. an 
antibody, an aptamer or another receptor recognizing a nucleic acid 
molecule of the SCAD gene family or a polypeptide encoded thereby 
for identifying substances capable of interacting with a SCAD 
homologous polypeptide. 

A non-human transgenic animal exhibiting a modified expression of 
a SCAD homologous polypeptide. 



30 



17. 



The animal of claim 1 6, wherein the expression of the SCAD 
homologous polypeptide is increased and/or reduced. 
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1 8. A recombinant host cell exhibiting a modified expression of a SCAD 
homologous polypeptide. 

19. The cell of claim 18 which is a human cell. 

20. A method of identifying a (poly)peptide involved in the regulation of 
energy homeostasis and/or metabolism of triglycerides in a mammal 
comprising the steps of 

(a) contacting a collection of (poly)peptides with a SCAD 
homologous polypeptide or a fragment thereof under 
conditions that allow binding of said (poly)peptides; 

(b) removing (poly)peptides which do not bind and 

(c) identifying (poly)peptides that bind to said SCAD homologous 
polypeptide. 

21 . A method of screening for an agent which modulates the interaction 
of a SCAD homologous polypeptide with a binding target/agent, 
comprising the steps of 

(a) incubating a mixture comprising 

(aa) a SCAD homologous polypeptide, or a fragment 
thereof; 

(ab) a binding target/agent of said SCAD homologous 
polypeptide or fragment thereof; and 

(ac) a candidate agent 

under conditions whereby said SCAD polypeptide or fragment 
thereof specifically binds to said binding target/agent at a 
reference affinity; 

(b) detecting the binding affinity of said SCAD polypeptide or 
fragment thereof to said binding target to determine an 
(candidate) agent-biased affinity; and 

(c) determining a difference between (candidate) agent-biased 
affinity and the reference affinity. 
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22. A method of screening for an agent which modulates the activity of 
a SCAD homologous polypeptide, comprising the steps of 
(a) incubating a mixture comprising 

(aa) a SCAD homologous polypeptide or a fragment thereof; 



(ab) a candidate agent 

under conditions whereby said SCAD polypeptide fragment 
thereof has a reference activity; 

(b) detecting the activity of said SCAD polypeptide or fragment 
thereof to determine an (candidate) agent-biased activity; and 

(c) determining a difference between (candidat) agent-biased 
activity and reference activity. 

23. A method of producing a composition comprising the (poly)peptide 
identified by the method of claim 20 or the agent identified by the 
method of claim 21 or 22 with a pharmaceutical^ acceptable 
carrier, diluent and/or adjuvant. 

24. The method of claim 23 wherein said composition is a 
pharmaceutical composition for preventing, alleviating or treating of 
diseases and disorders, including metabolic diseases such as obesity 
and other body-weight regulation disorders as well as related 
disorders such as metabolic syndrome, eating disorder, cachexia, 
diabetes mellitus, hypertension, coronary heart disease, 
hypercholesterolemia, dyslipidemia, osteoarthritis and/or gallstones 
and other diseases and disorders. 

25. Use of a (poly)peptide as identified by the method of claim 20 or of 
an agent as identified by the method of claim .21 or 22 for the 
preparation of a pharmaceutical composition for the treatment, 
alleviation and/or prevention of of diseases and disorders, including 
metabolic diseases such as obesity and other body-weight regulation 



and 
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disorders as well as related disorders such as metabolic syndrome, 
eating disorder, cachexia, diabetes mellitus, hypertension, coronary 
heart disease, hypercholesterolemia, dyslipidemia, osteoarthritis, 
and/or gallstones and other diseases and disorders. 

26. Use of a nucleic acid molecule of the SCAD family or of a fragment 
thereof for the preparation of a non-human animal which over- or 
under-expresses the CG3842 homologous gene product. 

27. Kit comprising at least one of 

(a) a SCAD nucleic acid molecule or a -fragment thereof; 

(b) a vector comprising the nucleic acid of (a); 

(c) a host cell comprising the nucleic acid of (a) or the vector of 

.(b); 

(d) a polypeptide encoded by the nucleic acid of (a); 

(e) a fusion polypeptide encoded by the nucleic acid of (a); 

(f) an antibody, an aptamer or another receptor against the 
nucleic acid of (a) or the polypeptide of (d) or (e) and 

(g) an anti-sense oligonucleotide of the nucleic acid of (a). 
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FIGURE 2. Molecular organisation of the gene CG3842 (GadFly Accession 
Number) 
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FIGURE 3. BLASTP RESULTS FOR CG3842 

FIGURE 3A. Homology to human gene ref XMJJ85058, protein ref XP.085058.1 

Length = 316 

SSi-tuT- SS.K'Si.r^iS 70 - "6/317 «„«. Gaps = 13/317 (4%, 

Query: <5 ^IZ*--------^^ 96 

Sbj cf 2 LVTlStLTSFFSFLYMWAPSIROTFAGGVCRTHVQLPGKVVVITGROT « 

Query: « «-~«-«^^ 

sbjct . 62 X" 

sbjct . « SHSS^SS«JSS2^^ « 

Query: 217 ^^KHVSKFFG^^^ ™ 

Sbj ct= 241 S l.-^^ S ^^^<^^^^ S ^ " 6 

Query; 337 MQTADWLWRESEKLLGL 3S3 

+TA+ LW S +LLG+ 
Sbjct: 297 NKTAERLWNVSCELLGI 313 

FIGURE 3B. Homology to human gene ref NM_020905, protein ref NP_065856.1 

>reflNP 06S9S6.ll (NM 020905) PAN2 protein [Homo sapiens] 
Length » 336 

s IQ :"i" t ie 2 9 s t s^ura^" — - 20/319 (6%) 
54 ^cxoop^ — •srsEK"?? sets 107 

sbjcc . 17 ^Larrfvgprvq^ 76 

^ery.. 103 PGRCEAARLDIMDRSRHQ ^ 

sbjct . 77 ^K^lRRE^QAAECGPEP^SGVGHLIVRE^I^LP^CQEMLQEEPRL 136 

Query: 155 »™»?SSOT T. 

sbjct . 137 s;^^^^^^" 196 

Query: 215 — —S — "* 

sbjc , 197 £ DI ^^^ 255 

Query: 275 ^^^^^^^^^^1^ ^ 

Query- 335 RNMQT AD WLVTRE S E KLLG L 353 

+ A LW SB ++GL 
Sbjct: 316 MDESVARKLWDISEVMVGL 334 
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FIGURE 3C. Homology to human gene ref NM.016026, protein ref NP_057110.1 

Pairwise alignment of Drosophila CG3842 encoded protein (query) and ^ dro f e ^ 
short-chain dehydrogenase/reductase I; prostate short-chain dehydrogenase reductase I, CGI- 
82 protein [Homo sapiens] (subject) 

>ref|NP_0S7110.ll (MM.016026) CGI-82 protein; likely ortholog of mouse cell line 
MC/9 IU derived transcript 1 [Homo sapxena] 
reflXP 031073. 1| <XM 031073) CGI-82 protein IHomo aapxens] 
=b|iSl4077.l|AFlS187o 1 (AF151840) CGI-82 protein (Homo aapxens] 

g RS l .1 AAH00112-(BC000112, CGI-82 »»~ r "££ , £ iwia (Homo sapiens, 

Ite AAK72049 1 AF39S068 1 (AF395068) HCV core-bxndxng protexn HCBP12 IHomo sap 
g|^ri727.l|A^11727-(BC011727) Similar to CGI-82 protein (Homo aapxens] 
Length = 318 

Query: 43 x---^ «• 

Sbjct: 8 LLLL^PFIiYMAAPQIRKMLSSGVCTSTVQLPGKVVVVT 67 

Sbjct: 68 ^^C^VEKGEbVAKEIQTTTGNQQVLVRKLDLSDTKSIRAFAKGFLAEEKHLHVIiINN 127 

Query: 161 AGVMACPRTLTADGFEQQFGVllHI/3HFLLTNLI<IiDRLKHSSPSRI^ 220 

AGVM CP ♦ TADGFE OVHHIX3HFLLT+LLL++IiK S+PSRIV VSS AH GRI+ 
Sbjct: 128 AG^CPYS.CTA^FE^IGVNHI^FI.I.TK^LEKLKESAPSRIVNVSSI^^RIHF 187 

Query: 221 SDUUHttllYSKFre^^ 280 
^.T EK Y+ AY SKLANILFT +L.+ LK +GVT HPG V++E+ RH S 

Sbjct: 188 H^EKFYNAGL-AYCHSKI^ILFT^ 246 

Query: 281 GWMKTALQKGSLYFFKTPKAGAQTQLRI^PQLEGSTGGYYSDCMRWPLFPWV^QTA 340 
MM +F KTP+ GAQT L AL LE +G ++SDC + RN A 

Sbjct: 247 rZwLFS- - -FFIKTPQQGAQTSIjHCAIjTEGLEILSGNHFSDCHVAWVSAQARNETIA 302 

Query: 341 DWLWRE S E KLLG L P 354 

LW S LLGLP 
Sbjct: 303 RRLWDVSCDLLGLP 316 
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FIGURE 4. CLUSTAL X (1.81) multiple sequence alignment 



CGI -82 
XP_085058 
cg3842 
PAN2 



CGI -82 
XP_085058 
cg3842 
PAN 2 



CGI -82 
XP_085058 
cg3842 
PAN2 



CGI -82 
XPJ>85058 
cg3842 
PAN2 



PGKVVVVTGANTGI GKETAKELAQRGARVYLACRDVE KGELVAKE I QTTTGNQ - 

PGKVW I TGANTGIGKETARELASRGARVYI ACRDVLKGES AAS E I RVDTKNS 

». . .**.* * : * : *.*: *.*:.**» : * • = : : 
qVLVRKLDLSDTKSIRAFAKGFIiAEEK-HLHVIjINNAGVMMCPYS-KTADGFEM 

qS^lsdtksirafaegfi^ek-qi^ilinnagvwmcpys-ctadgfet 

Slf^tldlgsi^sviwfverfkaees-mjjilinnagvmacprt-^feq 

... * *** . *:* * *« :«.:******:: 

HIGVNHLGHFLIjTHLLLEKLKESAPSRIVNVSSIiAHHIiGRIHFHNI^GEKFYNAGLi-AYC 

^H\^TOjG^FIJiTOTiLDRI<KHSS PSRI VWSSAAHLFGRINREDtMSEKNYSKFFGAYS 
;.*.**:****** *** ** ::*:*:* :**:** 

FMRWMWWLFS 



HSKLANI LFTQELARRLKGSGVTTYSVHPGTVQSELVRIiSS - 
.*****:*** :*: *..:.**. *** * I;: * : 



CGI -82 
XP_085058 
cg3842 
PAN 2 



-FFIKTPQQGAQTSLHCALTEGLEILSGNHFSDCHVA 
- PFVKTAREGAQTSLHCAIAEGLEPLSGKYFSDCKRT 
LYFFKTPKAGAQTQLRLALDPQLEGSTGGYYSDCMRW 
WAFFKTPVEGAQTSIYLASSPEVEGVSGRYFGDCKEE 
. **** j * :: :* ::.** 



WO 03/074078 



'CT/EP03/02363 



6/22 



Figure 5. Sequences of the human proteins of the invention 

Figure 5A. Nucleic acid sequence of human unnamed protein 
XP 085058 (SEQ TD NO:l) 

. * - aotottctta aqgactctct gcttcctgga caagctcaag ctaaggacta 
A 2££. SESE --age, c tS;S a ^c^c 
m tcagagtcag gctgctgctc agcacccagg aegg^gagga « ^ J ggagcagc 
181 gecaagaget 99agccagac caggaac eg g * gtcaccttgg gactgctcac 
241 agcaaaagca acagcagcta ca 9 aa 9"99 « g g a agga agttct ttgctggtgg 
301 ctccttcttc tcgttcctgt a»tgg»gc g ?| at ? actg gcg ccaacac 

361 agtgtgtaga acaaatgtgc a 9°"cctgg »899S gga gcccgag tetatattge 
42 l gggcattggc -ggagaegg -agagagct ^ atacaaagaa 
481 ctgeagagat 9tactgaagg gggagt g g acc aaatc tatcc gagecnttge 
541 ctcccaggtg ctggtgcgga aattggacct * y at caacaatg egggagtaat 

601 tgagggcttt ctggcagagg aaaa 9' a 9^ =««ttctg atca g tcaaccacct 
661 gatgtgtcca tattccaaga cagctgatgg ="tgaaacc ca ggg g 
72 1 gggccacttc ctcctcacct jeetjetoet ^geggcta -ggt^^g acgacctcca 
781 ggtggttaat ^gt-tegg ^acca cattgg^g agcaagctgg ccaatgtgct 
841 gagegagaag cgctacagca 9999^ * aa acaccggg gtcaccacct aegcagtgea 
901 ttttactcgt Sagctggcca ^aggctcca aggcaceggg 9 ^ tgctctggcg 
961 cccaggcgtc gtccgctctg a 9«99" g * cagaccagcc tgcactgcgc 

1021 gctcttctcc ccctttgtca agaeggcacg M ag j? gacCg ca agaggacctg 
1081 cctggctgag ggcctggagc ccctgagtgg a ctatggaatg tcagctgtga 

1141 ggtgtctcca agggeccgaa ataacaaaac a 9^gagcgc agg cccaatc 
1201 gct?ccagga atccggtggg agtagctggt 99aagagctg "9^"* f 
1261 catgecataa tgaacaggga ccaaggagaa SSccaaccct aaagg g ac 
1321 agctggtgct gcgaatcctg ectgc^ ^tgagSgt ggacaccS t agagtgttct 
1381 acctgacact cttgtgagac tggcttrtgg c«W^ aggactg ggc agatcccagg 
1441 tctctaagac ctggaaagtc a 9" a ""= ^Wggglc agttccctca tcagcaccag 
1501 ctgggcatgg gggtggcaga agageccgag aaa "999" J at acactcca 

55 =S3S 5=3= === £ S=3= =S= 

1801 agagaaaata aagactgeat tcatct 

Figure 5B. Amino acid sequence of human unnamed protein XP_085058 
(SEQ ID NO:2) 

l .ivelgllts ffsfly^ap sir^aggv -itgantg i = la 

61 srgarvyiac rdvlkgesaa seirvdtkns rikvsaparv vnvssvahhi 

121 ilinnagvmm cpysktadgf ethlgvnhlg MIJ^JJJ ^tyavhp gwrselvrh 

Si SKSE5 2332 SSSSi y£™ 

301 erlwnvscel lgirwe 
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Figure 5C. Nucleic acid sequence of human protein CGI-82 (SEQ ID 
NO:3) 

X gctggag ca t ccc g ctc Cgg t gcc g ct g ca googgejjjg 
61 gctgttgctc ctccttctgc ccttccttct 9"«Wt ggga ttgtggtcac 
121 gctgtccagt agcSaagag Sggctcagl g a gg a g ctc g 

181 a gg a g ctaat acag g tatcg ggaaggaga « 3 atqqccaaag a g atcca g ac 
241 agtatattta gcttgccggg atgtggaaaa fla999~»9 f tg * c tgata cLagtctat 
301 cacgacaggg aaccagcagg ^gtgcg JJ^g^ ctccacgttt t ga t caacaa 
361 tcgagctttt getw "tacSgaa glcagclgat ggctttgaga tgcacatagg 
421 tgca gg agt g at g at g tgtc °3"ctcgaa g g g agagaaa c taaaggaatc 
481 agccaaccac "gggtcact "ctcctaac J cac I tgggaa ggatccactt 

541 agccccatca aggaca g taa a ^gtcttc cc 9 gcct act g tc aca g caa g ct 

601 ccataacctg cagggcgaga ^attctacaa ^ca^cctg J gcg ttacgac 
661 agccaacatc ctcttcaccc aggaactggc ccggagacta aa gg^j, cttfccatgag 
72 l gtattctgta caccctjgca cagt'caatc ^aactgg^ cccagaccag 
781 at gg at g t gg tggcttttct cccccccc » qggaatcatt tca g t g act g 

841 cctgcactgc eccttaacag -ggtcttga ^taagt ggctgtggga 
901 tcatgtg g ca t ggg t g tct g tccaa 9«cg ca g a cagtgccag ttg gacccaa 
961 cgtcagttgt gacctgctgg Stgtcaaaa ?gaf tctcct ccaaggtttt 

1021 gagaagaccg cagcagacta ""9^" ="gcc g ttgg tgtC ca g tta 

1081 caaaaccttc agcacaaaga jagcaaaacc "ccagcctt g g ^ 
1141 aaactcagtg tactgccaga "cgtctaaa iaagaccctc atatgacctg 

1201 tgttactgcc agagttacta sagatrtcat ^99^ | aa ? ctaagc tatagcaggg 
1261 cacagctcat tttccttctg Lagttccto ccaaccaacc 

1321 atgatttatg caaatttgaa c"gcttctt ^^ a = aa & aca ? gaataa caaagactgg 
1381 agtcttcact tcaagagggc cacactgcaa ^cagctta ac 9 tcaaga ccag 
1441 ctcaggagca gggcttgccc a 99catggtg gatcaccgga ?. g t a tatc?t tgtgtgtctt 
1501 cctggccaac atggtgaaac cccacctcta =" aaaa ^| aaacagccac aat aatcaga 
1561 cctgtttatg tgtgccaagg -^f^ aatgc ^ gcaa aggg aagcag 

1621 gatggagcaa «cc*9t9°" gagcacatgg gtagcaggga agaagtaaaa 

1681 atcctgtata tgtt gg taac tacccacc aa gage gy g ggactagt t aa gg attaac 
1741 aaagagaagg agaatactgg aa ^ aa ^ £5g2E Igatattaaa tatgetaaca 
1801 ta g cccttta a gg attaac t a 3 ttaa 99 a ^ a ^"^ atg agg tcttaac aaaaacca g t 
1861 tagctatgga ggaattgagg 9caa g caccc ^IZllltl aSacaaaca aacaaaaaaa 
1921 gtggcaaaaa aaaaaaaaaa aaaaaaaaaa "^Si tggtaattat ggtca.ttta 
1981 acaattcttc attcacjaaaa a «atctta g 9gact g at |9 tg t g ccaaaa 
2041 ataatatttt gggg catttc "tacattgt ^"gacaaga tt 9 agtctaa gga 
2101 ttttgtattt tatttggaga cttcttatca aaa 9taat g c tgee gg ctcctg 
2161 attagtagc. — cac ttgtttggag tjtg-tto taaaag^ S 
2221 gaacgacaat tatattttaa ""9^999 3* a aC cattaagc tatatgttta 
2281 tctgatactt gtaaattaat <*"tattgc ^ttttg ac ga ataaatgaat 

2341 gaaatggtca ttttacggaa aaa " a 9 aaa ££££ taa Lattc t ttttgatft 
2 2 4 46i S££S ™c a ag SSSc aagaa.ca.a cagtcaaaaa 
2521 aaaaaaaaaa aaaaaaaa 

Figure 5D. Amino acid sequence of human protein CGI-82 

(SEQIDNO:4) ovctstvqlp gkvwvtgan tgigketake 

A SSSii SSS2S SSSSS 5»S L«-u- —i— 
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121 lhvlinnagv mmcpysktad gfemhigvnh 
181 hlgrihfhnl qgekfynagl aychsklani 
241 rhssfmrwmw wlfsffiktp qqgaqtslhc 
301 iarrlwdvsc dllglpid 



lghfllthll leklkesaps rivnvsslah 
lftqelarrl kgsgvttysv hpgtvqselv 
altegleils gnhfsdchva wvsvqarnet 
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